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(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DORSEY & WHITNEY LLP 
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(C) CITY: SAN FRANCISCO 
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(v) COMPUTER READABLE FORM: 
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(C) OPERATING SYSTEM: PC-DOS/MS-DOS 
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(viii) ATTORNEY/AGENT INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GGATCCACAG NNN 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

ATGGGCGATG CCGGCGTCGC GTTCACAG 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

ATGGAGGATG CTGGAGTGGC GTCACAG 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

ATGAGCGAGG CCGGCGTCGC GTCACAG 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
CCATCGATGC CGGACTGGTA TCCCAGGGGG 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

CCATCGATGC CGGACTGGTA TCCCGAGGGA C 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 



CCATCGATGA TCCAGCCTCC TCGCGGCGCC GGATGGGCA 39 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCTCTAGATC CATTCGCCAT CCGAAGATGC CCATCCGGC 39 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
■(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCATCGATTT ATGCCGAGAA GGTAACCAGA GAAACACAC 39 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
(Cy STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

- GCTCTAGACC AGGTAATATA CCACAACGTG TGTTTCTCT 39 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGGGGGAATT CATTTAGGTG ACACTATAGT TCTGCCTCCC CGGAC 4 5 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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GGGGGGATCC TGGTATCCCA GGGGGGC 

(2) INFORMATION FOR SEQ ID NO: 13: . 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCGGAAGCTT GTTTTTCTTT CTTTACCA 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GGGGGATCCG ATGGTATCCC GAGGGACGCT CAGCAGGTGG CATAGG 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 
'(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

AAATAATTTT GT TACT T TAG AAGGAGATAT ACATATGAGC GAGCGAGCAC AC 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

AAATAATTTT GTTTAACCTT AAGAAGGAGA TCTACATATG CTGGAGTGGC GTCAC 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGAGATCTAC ATATGGGAGA TGCTGGAGTG 



(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

GTAGCGAACG TCGAGAA 

(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

GGGGGATCCT CAGTTGTCAG TGGCGGGGTA G 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GGGGATCCCT AATTGGCACG AGCGGCGC 



(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

AATTACATAT GGCGGCCGCC GTTTCTGCC 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
AATTACATAT GTTCGCGGCC GCCGTTTCT 
(2) INFORMATION FOR SEQ ID NO: 23: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Phe Ala Ala Ala Val Ser Ala Phe Ala Ala Asn Met Leu Ser Ser Val 
15 10 15 

Leu Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Pro Thr Leu Val Asp Gin Gly Phe Trp lie Gly Gly Gin Tyr Ala Leu 
15 10 15 

Thr Pro Thr Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Phe Ala Ala Ala Val Ser 
1 5 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GCGCCCCCUG GGAUACCAGG AUC 23 

(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



TCAGCAGGTG GCATAGG 



17 



(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 6.. 32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

CCCAT ATG GGC GAT GCC GGC GTC GCG TCA CAG 32 
Met Gly Asp Ala Gly Val Ala Ser Gin 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
"(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 6.. 32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CCCAT ATG AGC GAG GCC GGC GTC GCG TCA CAG 32 
Met Ser Glu Ala Gly Val Ala Ser Gin 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - N-terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Ser Glu Ala Gly Val Ala Ser Gin 
1 5 



1 



5 



1 



5 
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(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

ATG GGA GAT GCT GGA GTG GCG TCA CAG 27 
Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GGGGGATCCC GCGGATTTAT GAGCGAG . 27 

(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs' • 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGGGGATCCC GCGGAGACAT GAGCGAGCAC AC 32 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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• 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

GGGGGATCCA GCGACATGAG AGATGCTGGA GTGG 34 

(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GGGGGATCCA GCGACATGAG AGATGCTGGA GTGG 34 



(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

GGGGGATCCG TTCTGCCTCC CCGGAC 26 

(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 37. .5145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACA ATG TAC GCG AAA GCG ACA 54 

Met Tyr Ala Lys Ala Thr 
1 5 
GAT GTC GCC TAC GCG AAC GTA 102 
Asp Val Ala Tyr Ala Asn Val 
20 

TTC GCC CCG CCA CTG AAG GCA 150 
Phe Ala Pro Pro Leu Lys Ala 
35 

CCG CTG CGC TTC AAA GGG GGC 198 
Pro Leu Arg Phe Lys Gly Gly 
50 

CTG GCC GGG CAC CAA CGT GTC 24 6 

Leu Ala Gly His Gin Arg Val 
65 70 
AGG GGA CGT AGC ACA GTG CTC 294 
Arg Gly Arg Ser Thr Val Leu 
80 85 



GAC GTG GCG 
Asp Val Ala 

CTG CAG CAG 
Leu Gin Gin 
25 

CTA GAA ACC 
Leu Glu Thr 
40 

ACT TTA CCC 
Thr Leu Pro 
55 

GCA GAA GAG 
Ala Glu Glu 



CGT GTC TAC 
Arg Val Tyr 
10 

AGA GCA GTC 
Arg Ala Val 

CTC CAC AGA 
Leu His Arg 

CCG ACA CAA 
Pro Thr Gin 
60 

GTT CTG CAC 
Val Leu His 
75 



GCC GCG GCA 
Ala Ala Ala 
15 

AAG TTG GAC 
Lys Leu Asp 
30 

CTG TAC TAT 
Leu Tyr Tyr 
45 

CAC CCG ATC 
His Pro lie 

AAT TTC GCC 
Asn Phe Ala 
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GAG 


ATA 


GGG 


CCG 


TCT 


CTG 


CAC 


AGC 


GCA 


CTT 


AAG 


CTA 


CAT 


GGG 


GCA 


CCG 


342 


Glu 


He 


Gly 


Pro 


Ser 


Leu 


His 


Ser 


Ala 


Leu 


Lys 


Leu 


His 


Gly 


Ala 


Pro 










90 










95 










100 








AAC 


GCC 


CCC 


GTC 


GCA 


GAC 


TAT 


CAC 


GGG 


TGC 


ACC 


AAG 


TAC 


GGC 


ACC 


CGC 


390 


Asn 


Ala 


Pro 


Val 


Ala 


Asp 


Tyr 


His 


Gly 


Cys 


Thr 


Lys 


Tyr 


Gly 


Thr 


Arg 








105 










110 










115 










GAC 


GGC 


TCG 


CGA 


CAC 


ATT 


ACG 


GCC 


TTA 


GAG 


TCT 


AGA 


TCC 


GTC 


GCC 


ACA 


438 


Asp 


Gly 


Ser 


Arg 


His 


He 


Thr 


Ala 


Leu 


Glu 


Ser 


Arg 


Ser 


Val 


Ala 


Thr 






120 










125 










130 












GGC 


CGG 


CCC 


GAG 


TTC 


AAG 


GCC 


GAC 


GCC 


TCA 


CTG 


CTC 


GCC 


AAC 


GGC 


ATT 


486 


Gly 


Arg 


Pro 


Glu 


Phe 


Lys 


Ala 


Asp 


Ala 


Ser 


Leu 


Leu 


Ala 


Asn 


Gly 


He 




135 










140 










145 










150 




GCC 


TCC 


CGC 


ACC 


TTC 


TGC 


GTC 


GAC 


GGA 


GTC 


GGC 


TCT 


TGC 


GCG 


TTC 


AAA 


534 


Ala 


Ser 


Arg 


Thr 


Phe 


Cys 


Val 


Asp 


Gly 


Val 


Gly 


Ser 


Cys 


Ala 


Phe 


Lys 












155 










160 










165 






TCG 


CGC 


GTT 


GGA 


ATT 


GCC 


AAT 


CAC 


TCC 


CTC 


TAT 


GAC 


GTG 


ACC 


CTA 


GAG 


582 


Ser 


Arg 


Val 


Gly 


He 


Ala 


Asn 


His 


Ser 


Leu 


Tyr 


Asp 


Val 


Thr 


Leu 


Glu 










170 










175 










180 








GAG 


CTG 


GCC 


AAT 


GCG 


TTT 


GAG 


AAC 


CAC 


GGA 


CTT 


CAC 


ATG 


GTC 


CGC 


GCG 


630 


Glu 


Leu 


Ala 


Asn 


Ala 


Phe 


Glu 


Asn 


His 


Gly 


Leu 


His 


Met 


Val 


Arg 


Ala 








185 










190 










195 










TTC 


ATG 


CAC 


ATG 


CCA 


GAA 


GAG 


CTG 


CTC 


TAC 


ATG 


GAC 


AAC 


GTG 


GTT 


AAT 


678 


Phe 


Met 


His 


Met 


Pro 


Glu 


Glu 


Leu 


Leu 


Tyr 


Met 


Asp 


Asn 


Val 


Val 


Asn 






200 










205 










210 












GCC 


GAG 


CTC 


GGC 


TAC 


CGC 


TTC 


CAC 


GTT 


ATT 


GAA 


GAG 


CCT 


ATG 


GCT 


GTG 


726 


Ala 


Glu 


Leu 


Gly 


Tyr 


Arg 


Phe 


His 


Val 


He 


Glu 


Glu 


Pro 


Met 


Ala 


Val 




215 










220 










225 










230 




AAG 


GAC 


TGC 


GCA 


TTC 


CAG 


GGG 


GGG 


GAC 


CTC 


CGT 


CTC 


CAC 


TTC 


CCT 


GAG 


774 


Lys 


Asp 


Cys 


Ala 


Phe 


Gin 


Gly 


Gly 


Asp 


Leu 


Arg 


Leu 


His 


Phe 


Pro 


Glu 












235 










240 










245 






TTG 


GAC 


TTC 


ATC 


AAC 


GAG 


AGC 


CAA 


GAG 


CGG 


CGC 


ATC 


GAG 


AGG 


CTG 


GCC 


822 


Leu 


Asp 


Phe 


He 


Asn 


Glu 


Ser 


Gin 


Glu 


Arg 


Arg 


He 


Glu 


Arg 


Leu 


Ala 










250 










255 










260 








GCC 


CGC 


GGC 


TCC 


TAC 


TCC 


AGA 


CGC 


GCC 


GTC 


ATT 


TTC 


TCC 


GGC 


GAC 


GAC 


870 


Ala 


Arg 


Gly 


Ser 


Tyr 


Ser 


Arg 


Arg 


Ala 


Val 


He 


Phe 


Ser 


Gly 


Asp 


Asp 








265 










270 










275 










GAC 


TGG 


GGT 


GAT 


GCG 


TAC 


TTA 


CAC 


GAC 


TTC 


CAC 


ACA 


TGG 


CTC 


GCC 


TAC 


918 


Asp 


Trp 


Gly 


Asp 


Ala 


Tyr 


Leu 


His 


Asp 


Phe 


His 


Thr 


Trp 


Leu 


Ala 


Tyr 






280 










285 










290 












CTA 


CTG 


GTG 


AGG 


AAC 


TAC 


CCC 


ACT 


CCG 


TTT 


GGT 


TTC 


TCA 


CTC 


CAT 


ATA 


966 


Leu 


Leu 


Val 


Arg 


Asn 


Tyr 


Pro 


Thr 


Pro 


Phe 


Gly 


Phe 


Ser 


Leu 


His 


He 




295 










300 










305 










310 




GAA 


GTC 


CAG 


AGG 


CGC 


CAC 


GGC 


TCC 


AGC 


ATT 


GAG 


CTG 


CGC 


ATC 


ACT 


CGC 


1014 


Glu 


Val 


Gin 


Arg 


Arg 


His 


Gly 


Ser 


Ser 


He 


Glu 


Leu 


Arg 


He 


Thr 


Arg 












315 










320 










325 






GCG 


CCA 


CCT 


GGA 


GAC 


CGC 


ATG 


CTG 


GCC 


GTC 


GTC 


CCA 


AGG 


ACG 


TCC 


CAA 


1062 


Ala 


Pro 


Pro 


Gly 


Asp 


Arg 


Met 


Leu 


Ala 


Val 


Val 


Pro 


Arg 


Thr 


Ser 


Gin 










330 










335 










340 








GGC 


CTC 


TGC 


AGA 


ATC 


CCA 


AAC 


ATC 


TTT 


TAT 


TAC 


GCC 


GAC 


GCG 


TCG 


GGC 


1110 


Gly 


Leu 


Cys 


Arg 


He 


Pro 


Asn 


He 


Phe 


Tyr 


Tyr 


Ala 


Asp 


Ala 


Ser 


Gly 








345 










350 










355 










ACT 


GAG 


CAT 


AAG 


ACC 


ATC 


CTT 


ACG 


TCA 


CAG 


CAC 


AAA 


GTC 


AAC 


ATG 


CTG 


1158 


Thr 


Glu 


His 


Lys 


Thr 


He 


Leu 


Thr 


Ser 


Gin 


His 


Lys 


Val 


Asn 


Met 


Leu 






360 










365 










370 












CTC 


AAT 


TTT 


ATG 


CAA 


ACG 


CGT 


CCT 


GAG 


AAG 


GAA 


CTA 


GTC 


GAC 


ATG 


ACC 


1206 


Leu 


Asn 


Phe 


Met 


Gin 


Thr 


Arg 


Pro 


Glu 


Lys 


Glu 


Leu 


Val 


Asp 


Met 


Thr 




375 










380 










385 










390 
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GTC TTG ATG TCG TTC GCG CGC GCT AGG CTG CGC GCG ATC GTG GTC GCC 1254 
Val Leu Met Ser Phe Ala Arg Ala Arg Leu Arg Ala lie Val Val Ala 

395 400 405 

TCA GAA GTC ACC GAG AGC TCC TGG AAC ATC TCA CCG GCT GAC CTG GTC 1302 
Ser Glu Val Thr Glu Ser Ser Trp Asn lie Ser Pro Ala Asp Leu Val 

410 415 420 

CGC ACT GTC GTG TCT CTT TAC GTC CTC CAC ATC ATC GAG CGC CGA AGG 1350 
Arg Thr Val Val Ser Leu Tyr Val Leu His lie lie Glu Arg Arg Arg 

425 430 435 

GCT GCG GTC GCT GTC AAG ACC GCC AAG GAC GAC GTC TTT GGA GAG ACT 1398 
Ala Ala Val Ala Val Lys Thr Ala Lys Asp Asp Val Phe Gly Glu Thr 

440 445 450 

TCG TTC TGG GAG AGT CTC AAG CAC GTC TTG GGC TCC TGT TGC GGT CTG 14 4 6 

Ser Phe Trp Glu Ser Leu Lys His Val Leu Gly Ser Cys Cys Gly Leu 
455 460 465 470 

CGC AAC CTC AAA GGC ACC GAC GTC GTC TTT ACT AAG CGC GTC GTC GAT 14 94 

Arg Asn Leu Lys Gly Thr Asp Val Val Phe Thr Lys Arg Val Val Asp 

475 480 485 

AAG TAC CGA GTC CAC TCG CTC GGA GAC ATA ATC TGC GAC GTC CGC CTG 1542 
Lys Tyr Arg Val His Ser Leu Gly Asp lie lie Cys Asp Val Arg Leu 

490 495 500 

TCC CCT GAA CAG GTC GGC TTC CTG CCG TCC CGC GTA CCA CCT GCC CGC 1590 
Ser Pro Glu Gin Val Gly Phe Leu Pro Ser Arg Val Pro^ Pro Ala Arg 

505 510 515 

GTC TTT CAC GAC AGG GAA GAG CTT GAG GTC CTT CGC GAA GCT GGC TGC 1638 
Val Phe His Asp Arg Glu Glu Leu Glu Val Leu Arg Glu Ala Gly Cys 

520 525 530 

TAC AAC GAA CGT CCG GTA CCT TCC ACT CCT CCT GTG GAG GAG CCC CAA 168 6 

Tyr Asn Glu Arg Pro Val Pro Ser Thr Pro Pro Val Glu Glu Pro Gin 
535 540 545 550 

GGT TTC GAC GCC GAC TTG TGG CAC GCG ACC GCG GCC TCA CTC CCC GAG 1734 
Gly Phe Asp Ala Asp Leu Trp His Ala Thr Ala Ala Ser Leu Pro Glu 

555 560 565 

TAC CGC GCC ACC TTG CAG GCA GGT CTC AAC ACC GAC GTC AAG CAG CTC 17 82 

Tyr Arg Ala Thr Leu Gin Ala Gly Leu Asn Thr Asp Val Lys Gin Leu 

570 575 580 

AAG ATC ACC CTC GAG AAC GCC CTC AAG ACC ATC GAC GGG CTC ACC CTC 1830 
Lys lie Thr Leu Glu Asn Ala Leu Lys Thr lie Asp Gly Leu Thr Leu 

585 590 595 

TCC CCA GTC AGA GGC CTC GAG ATG TAC GAG GGC CCG CCA GGC AGC GGC 187 8 

Ser Pro Val Arg Gly Leu Glu Met Tyr Glu Gly Pro Pro Gly Ser Gly 

600 605 610 

AAG ACG GGC ACC CTC ATC GCC GCC CTT GAG GCC GCG GGC GGT AAA GCA 1926 
Lys Thr Gly Thr Leu lie Ala Ala Leu Glu Ala Ala Gly Gly Lys Ala 
615 620 625 630 

CTT TAC GTG GCA CCC ACC AGA GAA CTG AGA GAG GCT ATG GAC CGG CGG 197 4 

Leu Tyr Val Ala Pro Thr Arg Glu Leu Arg Glu Ala Met Asp Arg Arg 

635 640 645 

ATC AAA CCG CCG TCC GCC TCG GCT ACG CAA CAT GTC GCC CTT GCG ATT 2022 
lie Lys Pro Pro Ser Ala Ser Ala Thr Gin His Val Ala Leu Ala lie 

650 655 . 660 

CTC CGT CGT GCC ACC GCC GAG GGC GCC CCT TTC GCT ACC GTG GTT ATC 2070 
Leu Arg Arg Ala Thr Ala Glu Gly Ala Pro Phe Ala Thr Val Val He 

665 670 675 

GAC GAG TGC TTC ATG TTC CCG CTC GTG TAC GTC GCG ATC GTG CAC GCC 2118 
Asp Glu Cys Phe Met Phe Pro Leu Val Tyr Val Ala He Val His Ala 
680 685 690 
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TTG TCC CCG AGC TCA CGA ATA GTC CTT GTA GGG GAC GTC CAC CAA ATC 2166 

Leu Ser Pro Ser Ser Arg lie Val Leu Val Gly Asp Val His Gin lie 

695 700 705 710 

GGG TTT ATA GAC TTC CAA GGC ACA AGC GCG AAC ATG CCG CTC GTT CGC 2214 

Gly Phe lie Asp Phe Gin Gly Thr Ser Ala Asn Met Pro Leu Val Arg 

715 720 725 

GAC GTC GTT AAG CAG TGC CGT CGG CGC ACT TTC AAC CAA ACC AAG CGC 2262 
Asp Val Val Lys Gin Cys Arg Arg Arg Thr Phe Asn Gin Thr Lys Arg 

730 735 740 

TGT CCG GCC GAC GTC GTT GCC ACC ACG TTT TTC CAG AGC TTG TAC CCC 2310 
Cys Pro Ala Asp Val Val Ala Thr Thr Phe Phe Gin Ser Leu Tyr Pro 

745 750 755 

GGG TGC ACA ACC ACC TCA GGG TGC GTC GCA TCC ATC AGC CAC GTC GCC 2358 
Gly Cys Thr Thr Thr Ser Gly Cys Val Ala Ser lie Ser His Val Ala 

760 765 770 

CCA GAC TAC CGC AAC AGC CAG GCG CAA ACG CTC TGC TTC ACG CAG GAG 24 06 

Pro Asp Tyr Arg Asn Ser Gin Ala Gin Thr Leu Cys Phe Thr Gin Glu 
775 780 785 790 

GAA AAG TCG CGC CAC GGG GCT GAG GGC GCG ATG ACT GTG CAC GAA GCG 24 54 

Glu Lys Ser Arg His Gly Ala Glu Gly Ala Met Thr Val His Glu Ala 

795 800 805 

CAG GGA CGC ACT TTT GCG TCT GTC ATT CTG CAT TAC AAC GGC TCC ACA 2502 
Gin Gly Arg Thr Phe Ala Ser Val lie Leu His Tyr Asn Gly Ser Thr 

810 815 820 

GCA GAG CAG AAG CTC CTC GCT GAG AAG TCG CAC CTT CTA GTC GGC ATC 2550 
Ala Glu Gin Lys Leu Leu Ala Glu Lys Ser His Leu Leu Val Gly lie 

825 830 835 

ACG CGC CAC ACC AAC CAC CTG TAC ATC CGC GAC CCG ACA GGT GAC ATT 2598 
Thr Arg His Thr Asn His Leu Tyr lie Arg Asp Pro Thr Gly Asp lie 

840 845 850 

GAG AGA CAA CTC AAC CAT AGC GCG AAA GCC GAG GTG TTT ACA GAC ATC 2 64 6 

Glu Arg Gin Leu Asn His Ser Ala Lys Ala Glu Val Phe Thr Asp lie 
855 ^ 860 865 870 

CCT GCA CCC CTG GAG ATC ACG ACT GTC AAA CCG AGT GAA GAG GTG CAG 2 694 

Pro Ala Pro Leu Glu lie Thr Thr Val Lys Pro Ser Glu Glu Val Gin 

875 880 885 

CGC AAC GAA GTG ATG GCA ACG ATA CCC CCG CAG AGT GCC ACG CCG CAC 27 42 

Arg Asn Glu Val Met Ala Thr lie Pro Pro Gin Ser Ala Thr Pro His 

890 895 900 

GGA GCA ATC CAT CTG CTC CGC AAG AAC TTC GGG GAC CAA CCC GAC TGT 27 90 

Gly Ala lie His Leu Leu Arg Lys Asn Phe Gly Asp Gin Pro Asp Cys 

905 910 915 

GGC TGT GTC GCT TTG GCG AAG ACC GGC TAC GAG GTG TTT GGC GGT CGT 2838 
Gly Cys Val Ala Leu Ala Lys Thr Gly Tyr Glu Val Phe Gly Gly Arg 

920 925 930 

GCC AAA ATC AAC GTA GAG CTT GCC GAA CCC GAC GCG ACC CCG AAG CCG 288 6 

Ala Lys lie Asn Val Glu Leu Ala Glu Pro Asp Ala Thr Pro Lys Pro 
935 940 945 950 

CAT AGG GCG TTC CAG GAA GGG GTA CAG TGG GTC AAG GTC ACC AAC GCG 2934 
His Arg Ala Phe Gin Glu Gly Val Gin Trp Val Lys Val Thr Asn Ala 

955 960 965 

TCT AAC AAA CAC CAG GCG CTC CAG ACG CTG TTG TCC CGC TAC ACC AAG 2982 
Ser Asn Lys His Gin Ala Leu Gin Thr Leu Leu Ser Arg Tyr Thr Lys 

970 975 980 

CGA AGC GCT GAC CTG CCG CTA CAC GAA GCT AAG GAG GAC GTC AAA CGC 3030 
Arg Ser Ala Asp Leu Pro Leu His Glu Ala Lys Glu Asp Val Lys Arg 

985 990 995 
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ATG CTA AAC TCG CTT GAC CGA CAT TGG GAC TGG ACT GTC ACT GAA GAC 3078 
Met Leu Asn Ser Leu Asp Arg His Trp Asp Trp Thr Val Thr Glu Asp 

1000 1005 1010 

GCC CGT GAC CGA GCT GTC TTC GAG ACC CAG CTC AAG TTC ACC CAA CGC 312 6 

Ala Arg Asp Arg Ala Val Phe Glu Thr Gin Leu Lys Phe Thr Gin Arg 
1015 1020 1025 1030 

GGC GGC ACC GTC GAA GAC CTG CTG GAG CCA GAC GAC CCC TAC ATC CGT 317 4 

Gly Gly Thr Val Glu Asp Leu Leu Glu Pro Asp Asp Pro Tyr lie Arg 

1035 1040 1045 

GAC ATA GAC TTC CTT ATG AAG ACT CAG CAG AAA GTG TCG CCC AAG CCG 3222 
Asp lie Asp Phe Leu Met Lys Thr Gin Gin Lys Val Ser Pro Lys Pro 

1050 1055 1060 

ATC AAT ACG GGC AAG GTC GGG CAG GGG ATC GCC GCT CAC TCA AAG TCT 3270 
He Asn Thr Gly Lys Val Gly Gin Gly He Ala Ala His Ser Lys Ser 

1065 1070 1075 

CTC AAC TTC GTC CTC GCC GCT TGG ATA CGC ATA CTC GAG GAG ATA CTC 3318 
Leu Asn Phe Val Leu Ala Ala Trp He Arg He Leu Glu Glu He Leu 

1080 1085 1090 

CGT ACC GGG AGC CGC ACG GTC CGG TAC AGC AAC GGT CTC CCC GAC GAA 3366 
Arg Thr Gly Ser Arg Thr Val Arg Tyr Ser Asn Gly Leu Pro Asp Glu 
1095 1100 1105 1110 

GAA GAG GCC ATG CTG CTC GAA GCG AAG ATC AAT CAA GTC CCA CAC GCC 3414 
Glu Glu Ala Met Leu Leu Glu Ala Lys He Asn Gin Val Pro His Ala 

1115 1120 1125 

ACG TTC GTC TCG GCG GAC TGG ACC GAG TTT GAC ACC GCC CAC AAT AAC 34 62 

Thr Phe Val Ser Ala Asp Trp Thr Glu Phe Asp Thr Ala His Asn Asn 

1130 1135 1140 

ACG AGT GAG CTG CTC TTC GCC GCC CTT TTA GAG CGC ATC GGC ACG CCT 3510 
Thr Ser Glu Leu Leu Phe Ala Ala Leu Leu Glu Arg He Gly Thr Pro 

1145 1150 1155 

GCA GCT GCC GTT AAT CTA TTC AGA GAA CGG TGT GGG AAA CGC ACC TTG 3558 
Ala Ala Ala Val Asn Leu Phe Arg Glu Arg Cys Gly Lys Arg Thr Leu 

1160 1165 1170 

CGA GCG AAG GGT CTA GGC TCC GTT GAA GTC GAC GGT CTG CTC GAC TCC 3606 
Arg Ala Lys Gly Leu Gly Ser Val Glu Val Asp Gly Leu Leu Asp Ser 
1175 1180 1185 1190 

GGC GCA GCT TGG ACG CCT TGC CGC AAC ACC ATC TTC TCT GCC GCC GTC 3654 
Gly Ala Ala Trp Thr Pro Cys Arg Asn Thr He Phe Ser Ala Ala Val 

1195 1200 1205 

ATG CTC ACG CTC TTC CGC GGC GTC AAG TTC GCA GCT TTC AAA GGC GAC 37 02 

Met Leu Thr Leu Phe Arg Gly Val Lys Phe Ala Ala Phe Lys Gly Asp 

1210 1215 1220 

GAC TCG CTC CTC TGT GGT AGC CAT TAC CTC CGT TTC GAC GCT AGC CGC 37 50 

Asp Ser Leu Leu Cys Gly Ser His Tyr Leu Arg Phe Asp Ala Ser Arg 

1225 1230 1235 

CTT CAC ATG GGC GAA CGT TAC AAG ACC AAA CAT TTG AAG GTC GAG GTG 37 98 

Leu His Met Gly Glu Arg Tyr Lys Thr Lys His Leu Lys Val Glu Val 

1240 1245 1250 

CAG AAA ATC GTG CCG TAC ATC GGA CTC CTC GTC TCC GCT GAG CAG GTC 384 6 

Gin Lys He Val Pro Tyr He Gly Leu Leu Val Ser Ala Glu Gin Val 
1255 1260 1265 1270 

GTC CTC GAC CCT GTC AGG AGC GCT CTC AAG ATA TTT GGG CGC TGC TAC 38 94 

Val Leu Asp Pro Val Arg Ser Ala Leu Lys He Phe Gly Arg Cys Tyr 

1275 1280 1285 

ACA AGC GAA CTC CTT TAC TCC AAG TAC GTG GAG GCT GTG AGA GAC ATC 3942 
Thr Ser Glu Leu Leu Tyr Ser Lys Tyr Val Glu Ala Val Arg Asp He 
1290 1295 1300 
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ACC AAG GGC TGG AGT GAG GCC CGC TAC CAC AGC CTC CTG TGC CAC ATG 3990 
Thr Lys Gly Trp Ser Asp Ala Arg Tyr His Ser Leu Leu Cys His. Met 

1305 1310 1315 

TCA GCA TGC TAC TAC AAT TAC GCG CCG GAG TCT GCG GCG TAC ATC ATC 4 038 

Ser Ala Cys Tyr Tyr Asn Tyr Ala Pro Glu Ser Ala Ala Tyr lie lie 

1320 1325 1330 

GAC GCT GTT GTT CGC TTT GGG CGC GGC GAC TTC CCG TTT GAA CAA CTG 4 08 6 

Asp Ala Val Val Arg Phe Gly Arg Gly Asp Phe Pro Phe Glu Gin Leu 
1335 1340 1345 • 1350- 

CGC GTG GTG CGT GCC CAT GTG CAG GCA CCC GAC GCT TAC AGC AGC ACG 4134 
Arg Val Val Arg Ala His Val Gin Ala Pro Asp Ala Tyr Ser Ser Thr 

1355 1360 1365 

TAT CCG GCT AAC GTG CGC GCA TCG TGC CTT GAC CAC GTC TTC GAG CCC 4182 
Tyr Pro Ala Asn Val Arg Ala Ser Cys Leu Asp His Val Phe Glu Pro 

1370 1375 1380 

CGC CAG GCC GCC GCC CCG GCA GGT TTC GTT GCG ACA TGT GCG AAG CCG 4230 
Arg Gin Ala Ala Ala Pro Ala Gly Phe Val Ala Thr Cys Ala Lys Pro 

1385 1390 1395 

GAA ACG CCT TCT TCA CTT ACC GCG AAA GCT GGT GTT TCT GCG ACT ACA 4 278" 

Glu Thr Pro Ser Ser Leu Thr Ala Lys Ala Gly Val Ser Ala Thr Thr 

1400 1405 1410 

AGC CAC GTT GCG ACT GGG ACT GCG CCC CCG GAG TCT CCA TGG GAT GCA 4 326 

Ser His Val Ala Thr Gly Thr Ala Pro Pro Glu Ser Pro Trp Asp Ala 
1415 1420 1425 1430 . 

CCT GCA GCC AAC AGC TTT TCG GAG TTA TTG ACA CCG GAG ACC CCG TCC 4 37 4 

Pro Ala Ala Asn Ser Phe Ser Glu Leu Leu Thr Pro Glu Thr Pro Ser 

1435 1440 1445 

ACA TCA TCC TCG CCG TCA TCG TCT TCA TCG GAC TCC TCT ACA TCG TGT 4 422 

Thr Ser Ser Ser Pro Ser Ser Ser Ser Ser Asp Ser Ser Thr Ser Cys 

1450 1455 1460 

GGA AGG TCG CTC AGT GGT GGA GAC ACC GCA AGG ACC ACA GAA GAC TTG 4 47 0 

Gly Arg Ser Leu Se^r Gly Gly Asp Thr Ala Arg Thr Thr Glu Asp Leu 

1465 1470 1475 

AAC AGC AGA AAG CCG CCT TCG CAA GAC AGG CAA TCA CGC TCG TCT GAA 4 518 

Asn Ser Arg Lys Pro Pro Ser Gin Asp Arg Gin Ser Arg Ser Ser Glu 

1480 1485 1490 

TGT CTG GAC AGA AGC GGA GAA AGG ACA GGC AGT TCG TTA ACT GCC CCC 4 566 

Cys Leu Asp Arg Ser Gly Glu Arg Thr Gly Ser Ser Leu Thr Ala Pro 
1495 1500 1505 1510 

ACT GCT CCG AGC CCC TCA TTC TCA TTT TCG GAA AGA GCT CGA CTG GCG 4 614 

Thr Ala Pro Ser Pro Ser Phe Ser Phe Ser Glu Arg Ala Arg Leu Ala 

1515 1520 1525 

ACC GGG CCG ACT GTC GCC GCT GCG ACA TCA CCT TCG GCA ACC CCA TCC 4 662 

Thr Gly Pro Thr Val Ala Ala Ala Thr Ser Pro Ser Ala Thr Pro Ser 

1530 1535 1540 

TGC GCC ACG GAC CAG GTT GCC GCG AGG ACC ACG CCG GAC TTT GCG CCT 4 710 

Cys Ala Thr Asp Gin Val Ala Ala Arg Thr Thr Pro Asp Phe Ala Pro 

1545 1550 1555 

TTC CTG GGT TCC CAG TCT GCC CGT GCT GTC TCG AAG CCG TAC CGG CCC 4 758 

Phe Leu Gly Ser Gin Ser Ala Arg Ala Val Ser Lys Pro Tyr Arg Pro 

1560 1565 1570 

CCC ACG ACT GCC CGT TGG AAA GAA GTC ACC CCG CTC CAC GCG TGG AAG 4 806 

Pro Thr Thr Ala Arg Trp Lys Glu Val Thr Pro Leu His Ala Trp Lys 
1575 1580 1585 1590 

GGC GTG ACC GGA GAC CGA CCG GAA GTC AGG GAG GAC CCG GAG ACA GCG 4 854 

Gly Val Thr Gly Asp Arg Pro Glu Val Arg Glu Asp Pro Glu Thr Ala 
1595 1600 1605 
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GCG GTC GTC CAG GCT CTG ATC AGC GGC CGT TAT CCT CAG AAG ACG AAG 4 902 

Ala Val Val Gin Ala Leu lie Ser Gly Arg Tyr Pro Gin Lys Thr Lys 

1610 1615 1620 

CTT TCC TCC GAC GCA TCC AAA GGC TAC TCA AGA ACT AAG GGA TGC TCA 4 950 

Leu Ser Ser Asp Ala Ser Lys Gly Tyr Ser Arg Thr Lys Gly Cys Ser 

1625 1630 1635 

CAA TCC ACC TCT TTT CCT GCC CCG AGT GCG GAT TAC CAG GCC CGC GAC 4 998 

Gin Ser Thr Ser Phe Pro Ala Pro Ser Ala Asp Tyr Gin Ala Arg Asp 

1640 1645 1650 

TGC CAG ACA GTC CGA GTC TGC CGC GCC GCT GCA GAG ATG GCG CGC TCA 504 6 

Cys Gin Thr Val Arg Val Cys Arg Ala Ala Ala Glu Met Ala Arg Ser 
1655 1660 1665 1670 

TGT ATT CAC GAG CCG TTG GCT TCA TCT GCC GCC AGT GCC GAC TTG AAG 5094 
Cys lie His Glu Pro Leu Ala Ser Ser Ala Ala Ser Ala Asp Leu Lys 

1675 1680 1685 

CGC ATA CGC TCT ACC TCG GAC TCT GTT CCC GAT GTA AAG ATC AGC AAG 514 2 

Arg lie Arg Ser Thr Ser Asp Ser Val Pro Asp Val Lys lie Ser Lys 

1690 1695 1700 

AGC GCA TGAAGGAACA AAATTAGTTT CCTTGTTCGT AAACAAGGTG GTCCCTCCCA 5198 
Ser Ala 

TTGAGGTAAA GACTCTGGTG AGTCCTCAAC GTTACTCGTT GAGTCTGCTG CGGTTCGATT 5258 
CCATTCCCAA GCAGCAAAGG GTGCGCAACT AGTACGGCGC CCCCTGGGAT ACCA 5312 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1704 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



Met Tyr Ala Lys 
1 

Val Ala Tyr Ala 
20 

Ala Pro Pro Leu 
35 

Leu Arg Phe Lys 
50 

Ala Gly His Gin 
65 

Gly Arg Ser Thr 

Lys Leu His Gly 
100 

Thr Lys Tyr Gly 
115 

Ser Arg Ser Val 
130 

Leu Leu Ala Asn 
145 

Gly Ser Cys Ala 

Tyr Asp Val Thr 
180 

Leu His Met Val 



Ala Thr Asp Val 
5 

Asn Val Leu Gin 

Lys Ala Leu Glu 
40 

Gly Gly Thr Leu 
55 

Arg Val Ala Glu 
70 

Val Leu Glu He 
85 

Ala Pro Asn Ala 

Thr Arg Asp Gly 
120 

Ala Thr Gly Arg 
135 

Gly He Ala Ser 
150 

Phe Lys Ser Arg 
165 

Leu Glu Glu Leu 
Arg Ala Phe Met 



Ala Arg Val Tyr 
10 

Gin Arg Ala Val 
25 

Thr Leu His Arg 

Pro Pro Thr Gin 
60 

Glu Val Leu His 
75 

Gly Pro Ser Leu 
90 

Pro Val Ala Asp 
105 

Ser Arg His He 

Pro Glu Phe Lys 
140 

Arg Thr Phe Cys 
155 

Val Gly He Ala 
170 

Ala Asn Ala Phe 
185 

His Met Pro Glu 



Ala Ala Ala Asp 
15 

Lys Leu Asp Phe 
30 

Leu Tyr Tyr Pro 
45 

His Pro He Leu 

Asn Phe Ala Arg 
80 

His Ser Ala Leu 
95 

Tyr His Gly Cys 
110 

Thr Ala Leu Glu 
125 

Ala Asp Ala Ser 

Val Asp Gly Val 
160 

Asn His Ser Leu 
175 

Glu Asn His Gly 
190 

Glu Leu Leu Tyr 
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195 

Met Asp Asn Val 
210 

Glu Glu Pro Met 
225 

Arg Leu His Phe 

Arg lie Glu Arg 
260 

lie Phe Ser Gly 
275 

His Thr Trp Leu 
290 

Gly Phe Ser Leu 
305 

Glu Leu Arg lie 

Val Pro Arg Thr 
340 

Tyr Ala Asp Ala 
355 

His Lys Val Asn 
370 

Glu Leu Val Asp 
385 

Arg Ala He Val 

Ser Pro Ala Asp 
420 

He He Glu Arg 
435 

Asp Val Phe Gly 
450 

Gly Ser Cys Cys 
465 

Thr Lys Arg Val 

He Cys Asp Val 
500 

Arg Val Pro Pro 
515 

Leu Arg Glu Ala 
530 

Pro Val Glu Glu 
545 

Ala Ala Ser Leu 

Thr Asp Val Lys 
580 

He Asp Gly Leu 
595 

Gly Pro Pro Gly 
610 

Ala Ala Gly Gly 
625 

Glu Ala Met Asp 



200 

Val Asn Ala Glu 
215 

Ala Val Lys Asp 
230 

Pro Glu Leu Asp 
245 

Leu Ala Ala Arg 

Asp Asp Asp Trp 
280 

Ala Tyr Leu Leu 
295 

His He Glu Val 
310 

Thr Arg Ala Pro 
325 

Ser Gin Gly Leu 

Ser Gly Thr Glu 
360 

Met Leu Leu Asn 
375 

Met Thr Val Leu 
390 

Val Ala Ser Glu 
405 

Leu Val Arg Thr 

Arg Arg Ala Ala 
440 

Glu Thr Ser Phe 
455 

Gly Leu Arg Asn 
470 

Val Asp Lys Tyr 
485 

Arg Leu Ser Pro 

Ala Arg Val Phe 
520 

Gly Cys Tyr Asn 
535 

Pro Gin Gly Phe 
550 

Pro Glu Tyr Arg 
565 

Gin Leu Lys He 

Thr Leu Ser Pro 
600 

Ser Gly Lys Thr 
615 

Lys Ala Leu Tyr 
630 

Arg Arg He Lys 
645 



Leu Gly Tyr Arg 
220 

Cys Ala Phe Gin 
235 

Phe He Asn Glu 
250 

Gly Ser Tyr Ser 
265 

Gly Asp Ala Tyr 

Val Arg Asn Tyr 
300 

Gin Arg Arg His 
315 

Pro Gly Asp Arg 
330 

Cys Arg He Pro 
345 

His Lys Thr He 

Phe Met Gin Thr 

380 

Met Ser Phe Ala 
395 

Val Thr Glu Ser 
410 

Val Val Ser Leu 
425 

Val Ala Val Lys 

Trp Glu Ser Leu 
460 

Leu Lys Gly Thr 
475 

Arg Val His Ser 
490 

Glu Gin Val Gly 
505 

His Asp Arg Glu 

Glu Arg Pro Val 
540 

Asp Ala Asp Leu 
555 

Ala Thr Leu Gin 
570 

Thr Leu Glu Asn 
585 

Val Arg Gly Leu 

Gly Thr Leu He 
620 

Val Ala Pro Thr 
635 

Pro Pro Ser Ala 
650 



205 



Phe 


His 


Val 


He 


Gly 


Gly Asp 


Leu 








240 


OCi 


Gin 


Glu 


Arg 






255 




Arg 


Arg 


Ala 


Val 




270 






Leu 


His 


Asp 


Phe 


285 








Pro 


Thr 


Pro 


Phe 


Gly 


Ser 


Ser 


He 








320 


Met 


Leu 


Ala 


Val 






335 




Asn 


He 


Phe 


Tyr 




350 






Leu 


Thr 


Ser 


Gin 


365 








Arg 


Pro 


Glu 


Lys 


Arg 


Ala 


Arg 


Leu 








400 


Ser 


Trp 


Asn 


lie 






415 




Tyr 


Val 


Leu 


His 




430 






Thr 


Ala 


Lys 


Asp 


445 








Lys 


His 


Val 


Leu 


Asp 


Val 


Val 


Phe 








480 


xj C LA 


Gly 


Asp 


He 






495 




Phe 


Leu 


Pro 


Ser 




510 






Glu 


Leu 


Glu 


Val 


525 








Pro 


Ser 


Thr 


Pro 


Trp 


His 


Ala 


Thr 








560 




Gly 


Leu 








575 




Ala 


Leu 


Lys 


Thr 




590 






Glu 


Met 


Tyr 


Glu 


605 








Ala 


Ala 


Leu 


Glu 


Arg 


Glu 


Leu 


Arg 








640 


Ser 


Ala 


Thr 


Gin 






655 





16 



His 


Val 


Ala 


Leu 


Ala 


He 


Leu 


Arg 


Arg 


Ala 


Thr 


Ala 


Glu 


Gly 


Ala 


Pro 








660 










665 










670 






Phe 


Ala 


Thr 


Val 


Val 


He 


Asp 


Glu 


Cys 


Phe 


Met 


Phe 


Pro 


Leu 


Val 


Tyr 






675 










680 










685 








Val 


Ala 


He 


Val 


His 


Ala 


Leu 


Ser 


Pro 


Ser 


Ser 


Arg 


He 


Val 


Leu 


Val 




690 










695 










700 










Gly 


Asp 


Val 


His 


Gin 


lie 


Gly 


Phe 


He 


Asp 


Phe 


Gin 


Gly 


Thr 


Ser 


Ala 


705 










710 










715 










720 


Asn 


Met 


Pro 


Leu 


Val 


Arg 


Asp 


Val 


Val 


Lys 


Gin 


Cys 


Arg 


Arg 


Arg 


Thr 










725 










730 










735 




Phe 


Asn 


Gin 


Thr 


Lys 


Arg 


Cys 


Pro 


Ala 


Asp 


Val 


Val 


Ala 


Thr 


Thr 


Phe 








740 










745 










750 






Phe 


Gin 


Ser 


Leu 


Tyr 


Pro 


Gly 


Cys 


Thr 


Thr 


Thr 


Ser 


Gly 


Cys 


Val 


Ala 






755 










760 










765 








Ser 


He 


Ser 


His 


Val 


Ala 


Pro 


Asp 


Tyr 


Arg 


Asn 


Ser 


Gin 


Ala 


Gin 


Thr 




770 










775 










780 










Leu 


Cys 


Phe 


Thr 


Gin 


Glu 


Glu 


Lys 


Ser 


Arg 


His 


Gly 


Ala 


Glu 


Gly 


Ala 


785 










790 










795 










800 


Met 


Thr 


Val 


His 


Glu 


Ala 


Gin 


Gly 


Arg 


Thr 


Phe 


Ala 


Ser 


Val 


He 


Leu 










805 










810 










815 




His 


Tyr 


Asn 


Gly 


Ser 


Thr 


Ala 


Glu 


Gin 


Lys 


Leu 


Leu 


Ala 


Glu 


Lys 


Ser 








820 










825 










830 






His 


Leu 


Leu 


Val 


Gly 


He 


Thr 


Arg 


His 


Thr 


Asn 


His 


Leu 


Tyr 


He 


Arg 






835 










840 










845 








Asp 


Pro 


Thr 


Gly 


Asp 


He 


Glu 


Arg 


Gin 


Leu 


Asn 


His 


Ser 


Ala 


Lys 


Ala 




850 










855 










860 










Glu 


Val 


Phe 


Thr 


Asp 


He 


Pro 


Ala 


Pro 


Leu 


Glu 


He 


Thr 


Thr 


Val 


Lys 


865 










870 










875 










880 


Pro 


Ser 


Glu 


Glu 


Val 


Gin 


Arg 


Asn 


Glu 


Val 


Met 


Ala 


Thr 


He 


Pro 


Pro 










885 










890 










895 




Gin 


Ser 


Ala 


Thr 


Pro 


His 


Gly 


Ala 


He 


His 


Leu 


Leu 


Arg 


Lys 


Asn 


Phe 








900 










905 










910 






Gly 


Asp 


Gin 


Pro 


Asp 


Cys 


Gly 


Cys 


Val 


Ala 


Leu 


Ala 


Lys 


Thr 


Gly 


Tyr 






915 










920 










925 








Glu 


Val 


Phe 


Gly 


Gly 


Arg 


Ala 


Lys 


He 


Asn 


Val 


Glu 


Leu 


Ala 


Glu 


Pro 




930 










935 










940 










Asp 


Ala 


Thr 


Pro 


Lys 


Pro 


His 


Arg 


Ala 


Phe 


Gin 


Glu 


Gly 


Val 


Gin 


Trp 


945 










950 










955 










960 


Val 


Lys 


Val 


Thr 


Asn 


Ala 


Ser 


Asn 


Lys 


His 


Gin 


Ala 


Leu 


Gin 


Thr 


Leu 










965 










970 










975 




Leu 


Ser 


Arg 


Tyr 


Thr 


Lys 


Arg 


Ser 


Ala 


Asp 


Leu 


Pro 


Leu 


His 


Glu 


Ala 








980 










985 










990 






Lys 


Glu 


Asp 


Val 


Lys 


Arg 


Met 


Leu 


Asn 


Ser 


Leu 


Asp 


Arg 


His 


Trp 


Asp 






995 










1000 








1005 






Trp 


Thr 


Val 


Thr 


Glu 


Asp 


Ala 


Arg 


Asp 


Arg 


Ala 


Val 


Phe 


Glu 


Thr 


Gin 




1010 








1015 








1020 








Leu 


Lys 


Phe 


Thr 


Gin 


Arg 


Gly 


Gly 


Thr 


Val 


Glu 


Asp 


Leu 


Leu 


Glu 


Pro 


1025 








1030 








1035 








104! 


Asp 


Asp 


Pro 


Tyr 


He 


Arg 


Asp 


He 


Asp 


Phe 


Leu 


Met 


Lys 


Thr 


Gin 


Gin 










104 J 










1050 








1055 


Lys 


Val 


Ser 


Pro 


Lys 


Pro 


He 


Asn 


Thr 


Gly 


Lys 


Val 


Gly 


Gin 


Gly 


He 








1060 








1065 








1070 




Ala 


Ala 


His 


Ser 


Lys 


Ser 


Leu 


Asn 


Phe 


Val 


Leu 


Ala 


Ala 


Trp 


He 


Arg 






1075 








1080 








1085 






He 


Leu 


Glu 


Glu 


He 


Leu 


Arg 


Thr 


Gly 


Ser 


Arg 


Thr 


Val 


Arg 


Tyr 


Ser 




1090 








1095 








1100 








Asn 


Gly Leu 


Pro Asp 


Glu 


Glu 


Glu 


Ala 


Met 


Leu 


Leu 


Glu 


Ala 


Lys 


He 
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1105 1110 1115 1120 

Asn Gin Val Pro His Ala Thr Phe Val Ser Ala Asp Trp Thr Glu Phe 

1125 1130 1135 

Asp Thr Ala His Asn Asn Thr Ser Glu Leu Leu Phe Ala Ala Leu Leu 

1140 1145 1150 

Glu Arg lie Gly Thr Pro Ala Ala Ala Val Asn Leu Phe Arg Glu Arg 

1155 1160 1165 

Cys Gly Lys Arg Thr Leu Arg Ala Lys Gly Leu Gly Ser Val Glu Val 

1170 1175 1180 

Asp Gly Leu Leu Asp Ser Gly Ala Ala Trp Thr Pro Cys Arg Asn Thr 
1185 1190 1195 1200 

He Phe Ser Ala Ala Val Met Leu Thr Leu Phe Arg Gly Val Lys Phe 

1205 1210 1215 

Ala Ala Phe Lys Gly Asp Asp Ser Leu Leu Cys Gly Ser His Tyr Leu 

1220 1225 1230 

Arg Phe Asp Ala Ser Arg Leu His Met Gly Glu Arg Tyr Lys Thr Lys 

1235 1240 1245 

His Leu Lys Val Glu Val Gin Lys He Val Pro Tyr He Gly Leu Leu 

• 1250 1255 1260 

Val Ser Ala Glu Gin Val Val Leu Asp Pro Val Arg Ser Ala Leu Lys 
1265 1270 1275 1280 

He Phe Gly Arg Cys Tyr Thr Ser Glu Leu Leu Tyr Ser Lys Tyr Val 

1285 1290 1295 

Glu Ala Val Arg Asp He Thr Lys Gly Trp Ser Asp Ala Arg Tyr His 

1300 1305 1310 

Ser Leu Leu Cys His Met Ser Ala Cys Tyr Tyr Asn Tyr Ala Pro Glu 

1315 1320 1325 

Ser Ala Ala Tyr He lie Asp Ala Val Val Arg Phe Gly Arg Gly Asp 

1330 1335 1340 

Phe Pro Phe Glu Gin Leu Arg Val Val Arg Ala His Val Gin Ala Pro 
1345 1350 1355 1360 

Asp Ala Tyr Ser Ser Thr Tyr Pro Ala Asn Val Arg Ala Ser Cys Leu 

1365 1370 1375 

Asp His Val Phe Glu Pro Arg Gin Ala Ala Ala Pro Ala Gly Phe Val 

1380 1385 1390 

Ala Thr Cys Ala Lys Pro Glu Thr Pro Ser Ser Leu Thr Ala Lys Ala 

1395 1400 1405 

Gly Val Ser Ala Thr Thr Ser His Val Ala Thr Gly Thr Ala Pro Pro 

1410 1415 1420 

Glu Ser Pro Trp Asp Ala Pro Ala Ala Asn Ser Phe Ser Glu Leu Leu 
1425 1430 1435 1440 

Thr Pro Glu Thr Pro Ser Thr Ser Ser Ser Pro Ser Ser Ser Ser Ser 

1445 1450 1455 

Asp Ser Ser Thr Ser Cys Gly Arg Ser Leu Ser Gly Gly Asp Thr Ala 

1460 1465 1470 

Arg Thr Thr Glu Asp Leu Asn Ser Arg Lys Pro Pro Ser Gin Asp Arg 

1475 1480 1485 

Gin Ser Arg Ser Ser Glu Cys Leu Asp Arg Ser Gly Glu Arg Thr Gly 

1490 1495 1500 

Ser Ser Leu Thr Ala Pro Thr Ala Pro Ser Pro Ser Phe Ser Phe Ser 
1505 1510 . 1515 1520 

Glu Arg Ala Arg Leu Ala Thr Gly Pro Thr Val Ala Ala Ala Thr Ser 

1525 1530 1535 

Pro Ser Ala Thr Pro Ser Cys Ala Thr Asp Gin Val Ala Ala Arg Thr 

1540 1545 1550 

Thr Pro Asp Phe Ala Pro Phe Leu Gly Ser Gin Ser Ala Arg Ala Val 
1555 1560 1565 
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Ser Lys Pro Tyr Arg Pro Pro Thr Thr Ala Arg Trp Lys Glu Val Thr 

1570 1575 1580 

Pro Leu His Ala Trp Lys Gly Val Thr Gly Asp Arg Pro Glu Val Arg 
1585 1590 1595 160 

Glu Asp Pro Glu Thr Ala Ala Val Val Gin Ala Leu lie Ser Gly Arg 

1605 1610 1615 

Tyr Pro Gin Lys Thr Lys Leu Ser Ser Asp Ala Ser Lys Gly Tyr Ser 

1620 1625 1630 

Arg Thr Lys Gly Cys Ser Gin Ser Thr Ser Phe Pro Ala Pro Ser Ala 

1635 1640 1645 

Asp Tyr Gin Ala Arg Asp Cys Gin Thr Val Arg Val Cys Arg Ala Ala 

1650 1655 1660 

Ala Glu Met Ala Arg Ser Cys lie His Glu Pro Leu Ala Ser Ser Ala 



Ala Ser Ala Asp Leu Lys Arg lie Arg Ser Thr Ser Asp Ser Val Pro 

1685 1690 1695 

Asp Val Lys lie Ser Lys Ser Ala 
1700 

(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single. 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 4218.. 4512 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT ACGCGAAAGC GACAGACGTG 60 

GCGCGTGTCT ACGCCGCGGC AGATGTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 120 

AAGTTGGACT TCGCCCCGCC ACTGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 180 

CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 24 0 

CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 300 

GGGCCGTCTC TGCACAGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 360 

TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC G AC AC ATT AC GGCCTTAGAG 4 20 

TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 480 

GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 54 0 

GTTGGAATTG CCAATCACTC CCTCTATGAC GTGACCCTAG AGGAGCTGGC CAATGCGTTT 600 

GAGAACCACG GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 660 

ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCCTATG 720 

GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 780 

TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 84 0 

AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 900 

CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 960 

CATATAGAAG TCCAGAGGCG CCACGGCTCC AG CAT T GAG C TGCGCATCAC TCGCGCGCCA 1020 * 

CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 1080 

AACATCTTTT ATTACGCCGA CGCGTCGGGC ACTGAGCATA AGACCATCCT TACGTCACAG 114 0 

CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA ACTAGTCGAC 1200 

ATGACCGTCT TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 1260 

GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 

TACGTCCTCC ACATCATCGA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 1380 

GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 144 0 

GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 1500 
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CGAGTCCACT CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 

TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 1620 

CTTCGCGAAG CTGGCTGCTA CAACGAACGT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 1680 

CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 174 0 

GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 

GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GAT GT AC GAG 18 60 

GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 1920 

AAAGCACTTT ACGTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GCGGATCAAA 1980 

CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 204 0 

GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 2100 

GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 2160 

CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGCGAACA TGCCGCTCGT TCGCGACGTC 2220 

GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 2280 

GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 234 0 

TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 24 00 

CAGGAGGAAA AGTCGCGCCA CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGCAGGGA 24 60 

CGCACTTTTG CGTCTGTCAT TCTGCATTAC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 2520 

GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 2580 

GACCCGACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 264 0 

GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 2700 

GAAGTGATGG CAACGATACC CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 27 60 

CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 2820 

GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 2880 

AAGCCGCATA GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 294 0 

AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 

CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 

TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 

CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 3180 

GACTTCCTTA TGAAGACTCA GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 324 0 

GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 3300 

ATACTCGAGG AGATACTCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 3360 

GACGAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 34 20 

GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA ACACGAGTGA GCTGCTCTTC 3480 

GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 354 0 

TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 

GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCTGCCGC CGTCATGCTC 3660 

ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT CCTCTGTGGT 3720 

AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 3780 

CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 384 0 

CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 3900 

GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GTGAGAGACA TCACCAAGGG CTGGAGTGAC 3960 

GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 4 020 

TCTGCGGCGT ACATCATCGA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 4 080 

CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 414 0 

GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 4200 

GCAGGTTTCG TTGCGAC ATG TGC GAA GCC GGA AAC GCC TTC TTC ACT TAC 4250 

Met Cys Glu Ala-Gly Asn Ala Phe Phe Thr Tyr 
15 10 

CGC GAA AGC TGG TGT TTC TGC GAC TAC AAG CCA CGT TGC GAC TGG GAC 4298 
Arg Glu Ser Trp Cys Phe Cys Asp Tyr Lys Pro Arg Cys Asp Trp Asp 

15 20 25 

TGC GCC CCC GGA GTC TCC ATG GGA TGC ACC TGC AGC CAA CAG CTT TTC 434 6 
Cys Ala Pro Gly Val Ser Met Gly Cys Thr Cys Ser Gin Gin Leu Phe 

30 35 40 

GGA GTT ATT GAC ACC GGA GAC CCC GTC CAC ATC ATC CTC GCC GTC ATC 4 394 
Gly Val He Asp Thr Gly Asp Pro Val His He He Leu Ala Val He 
45 .50 55 
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GTC TTC ATC GGA CTC CTC TAC ATC GTG TGG AAG GTC GCT CAG TGG TGG 4 4 42 
Val Phe He Gly Leu Leu Tyr He Val Trp Lys Val Ala Gin Trp Trp 
60 65 70 75 

AGA CAC CGC AAG GAC CAC AGA AGA CTT GAA CAG CAG AAA GCC GCC TTC 4 4 90 
Arg His Arg Lys Asp His Arg Arg Leu Glu Gin Gin Lys Ala Ala Phe 

80 85 90 

GCA AGA CAG GCA ATC ACG CTC GTC TGAATGTC TGGACAGAAG CGGAGAAAGG 4 542 
Ala Arg Gin Ala He Thr Leu Val 
• 95 

ACAGGCAGTT CGTTAACTGC CCCCACTGCT CCGAGCCCCT CATTCTCATT TTCGGAAAGA 4 602 

GCTCGACTGG CGACCGGGCC GACTGTCGCC GCTGCGACAT CACCTTCGGC AACCCCATCC 4 662 

TGCGCCACGG ACCAGGTTGC CGCGAGGACC ACGCCGGACT TTGCGCCTTT CCTGGGTTCC 4 722 

CAGTCTGCCC GTGCTGTCTC GAAGCCGTAC CGGCCCCCCA CGACTGCCCG TTGGAAAGAA 4782 

GTCACCCCGC TCCACGCGTG GAAGGGCGTG ACCGGAGACC GACCGGAAGT CAGGGAGGAC 4 842 

CCGGAGACAG CGGCGGTCGT CCAGGCTCTG ATCAGCGGCC GTTATCCTCA GAAGACGAAG 4 902 

CTTTCCTCCG ACGCATCCAA AGGCTACTCA AGAACTAAGG GAT GCT CAC A ATCCACCTCT 4 962 

TTTCCTGCCC CGAGTGCGGA TTACCAGGCC CGCGACTGCC AGACAGTCCG AGTCTGCCGC 5022 

GCCGCTGCAG AGATGGCGCG CTCATGTATT CACGAGCCGT TGGCTTCATC TGCCGCCAGT 5082 

GCCGACTTGA AGCGCATACG CTCTACCTCG GACTCTGTTC CCGATGTAAA GATCAGCAAG 514 2 

AGCGCATGAA GGAACAAAAT TAGTTTCCTT GTTCGTAAAC AAGGTGGTCC CTCCCATTGA 5202 

GGTAAAGACT CTGGTGAGTC CTCAACGTTA CTCGTTGAGT CTGCTGCGGT TCGATTCCAT 5262 

TCCCAAGCAG CAAAGGGTGC GCAACTAGTA CGGCGCCCCC TGGGATACCA 5312 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



Met 


Cys 


Glu 


Ala 


Gly 


Asn 


Ala 


Phe 


Phe 


Thr 


Tyr 


Arg 


Glu 


Ser 


Trp 


Cys 


1 








5 










10 










15 




Phe 


Cys 


Asp 


Tyr 
20 


Lys 


Pro 


Arg 


Cys" 


Asp 
25 


Trp 


Asp 


Cys 


Ala 


Pro 
30 


Gly 


Val 


Ser 


Met 


Gly 


Cys 


Thr 


Cys 


Ser 


Gin 


Gin 


Leu 


Phe 


Gly Val 


He 


Asp 


Thr 






35 










40 










45 








Gly 


Asp 
50 


Pro 


Val 


His 


He 


He 
55 


Leu 


Ala 


Val 


He 


Val 
60 


Phe 


He 


Gly 


Leu 


Leu 


Tyr 


He 


Val 


Trp 


Lys 


Val 


Ala 


Gin 


Trp 


Trp 


Arg 


His 


Arg 


Lys 


Asp 


65 










70 










75 










80 


His 


Arg 


Arg 


Leu 


Glu 
85 


Gin 


Gin 


Lys 


Ala 


Ala 
90 


Phe 


Ala 


Arg 


Gin 


Ala 
95 


He 



Thr Leu Val 



(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4518.. 4937 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
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GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT ACGCGAAAGC GACAGACGTG 60 

GCGCGTGTCT ACGCCGCGGC AGATGTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 120 

AAGTTGGACT TCGCCCCGCC ACTGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 180 

CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 240 

CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 300 

GGGCCGTCTC TGCACAGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 360 

TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC GACACATTAC GGCCTTAGAG 420 

TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 4 80 

GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 54 0 

GTTGGAATTG CCAATCACTC CCTCTATGAC GTGACCCTAG AGGAGCTGGC CAATGCGTTT 600 

GAGAACCACG GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 660 

ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCCTATG 720 

GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 78.0 

TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 84 0 

AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 900 

CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 960 

CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 1020 

CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 1080 

AACATCTTTT ATTACGCCGA CGCGTCGGGC ACTGAGCATA AGACCATCCT TACGTCACAG 1140 

CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA ACTAGTCGAC 1200 

ATGACCGTCT TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 1260 

GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 

TACGTCCTCC ACATCATCGA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 1380 

GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 14 40 

GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 1500 

CGAGTCCACT CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 

TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 1620 

CTTCGCGAAG CTGGCTGCTA CAACGAACGT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 1680 

CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 17 40 

GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 

GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GAT GT AC GAG 1860 

GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 1920 

AAAGCACTTT ACGTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GCGGATCAAA 1980 

CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 204 0 

GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 2100 

GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 2160 

CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGCGAACA TGCCGCTCGT TCGCGACGTC 2220 

GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 2280 

GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 234 0 

TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 24 00 

CAGGAGGAAA AGTCGCGCCA CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGCAGGGA 24 60 

CGCACTTTTG CGTCTGTCAT TCTGCATTAC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 2520 

GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 2580 

GACCCGACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 264 0 

GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 2700 

GAAGTGATGG CAACGATACC CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 27 60 

CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 2820 

GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 2880 

AAGCCGCATA GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 294 0 

AAACACCAGG CGCTCCAGAC GCTGTTGTGC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 

CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 

TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 

CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 3180 

GACTTCCTTA TGAAGACTCA GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 3240 

GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 3300 

ATACTCGAGG AGATACTCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 3360 
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GACGAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 3420 

GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA ACACGAGTGA GCTGCTCTTC 34 80 

GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 354 0 

TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 

GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCTGCCGC CGTCATGCTC 3660 

ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT CCTCTGTGGT 3720 

AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 3780 

CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 384 0 

CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 3900 

GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GT GAG AG AC A TCACCAAGGG CTGGAGTGAC 3960 

GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 4 020 

TCTGCGGCGT ACATCATCGA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 4 080 

CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 414 0 

GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 4 200 

GCAGGTTTCG TTGCGACATG TGCGAAGCCG GAAACGCCTT CTTCACTTAC CGCGAAAGCT 4 260 

GGTGTTTCTG CGACTACAAG CCACGTTGCG ACTGGGACTG CGCCCCCGGA GTCTCCATGG 4 320 

GATGCACCTG CAGCCAACAG CTTTTCGGAG TTATTGACAC CGGAGACCCC GTCCACATCA 4 380 

TCCTCGCCGT CATCGTCTTC ATCGGACTCC TCTACATCGT GTGGAAGGTC GCTCAGTGGT 4 44 0 

GGAGACACCG CAAGGACCAC AGAAGACTTG AACAGCAGAA AGCCGCCTTC GCAAGACAGG 4 500 

CAATCACGCT CGTCTGA ATG TCT GGA CAG AAG CGG AGA AAG GAC AGG CAG 4550 

Met Ser Gly Gin Lys Arg Arg Lys Asp Arg Gin 
1 5 10 

TTC GTT AAC TGC CCC CAC TGC TCC GAG CCC CTC ATT CTC ATT TTC GGA 4 598 
Phe Val Asn Cys Pro His Cys Ser Glu Pro Leu lie Leu lie Phe Gly 

15 20 25 

AAG AGC TCG ACT GGC GAC CGG GCC GAC TGT CGC CGC TGC GAC ATC ACC 4 64 6 
Lys Ser Ser Thr Gly Asp Arg Ala Asp Cys Arg Arg Cys Asp lie Thr 

30 35 40 

TTC GGC AAC CCC ATC CTG CGC CAC GGA CCA GGT TGC CGC GAG GAC CAC 4 694 
Phe Gly Asn Pro lie Leu Arg His Gly Pro Gly Cys Arg Glu Asp His 

45 50 55 

GCC GGA CTT TGC GCC TTT CCT GGG TTC CCA GTC TGC CCG TGC TGT CTC 4 742 
Ala Gly Leu Cys Ala Phe Pro Gly Phe Pro Val Cys Pro Cys Cys Leu 
60 65 70 75 

GAA GCC GTA CCG GCC CCC CAC GAC TGC CCG TTG GAA AGA AGT CAC CCC 4 790 
Glu Ala Val Pro Ala Pro His Asp Cys Pro Leu Glu Arg Ser His Pro 

80 85 90 

GCT CCA CGC GTG GAA GGG CGT GAC CGG AGA CCG ACC GGA AGT CAG GGA 4 838 
Ala Pro Arg Val Glu Gly Arg Asp Arg Arg Pro Thr Gly Ser Gin Gly 

95 100 105 

GGA CCC GGA GAC AGC GGC GGT CGT CCA GGC TCT GAT CAG CGG CCG TTA 4 88 6 
Gly Pro Gly Asp Ser Gly Gly Arg Pro Gly Ser Asp Gin Arg Pro Leu 

110 115 120 

TCC TCA GAA GAC GAA GCT TTC CTC CGA CGC ATC CAA AGG CTA CTC AAG 4 934 
Ser Ser Glu Asp Glu Ala Phe Leu Arg Arg lie Gin Arg Leu Leu Lys 

125 130 135 ■ 

AAC TAAGGGATGC TCACAATCCA CCTCTTTTCC TGCCCCGAGT GCGGATTACC 4 987 
Asn 
140 

AGGCCCGCGA CTGCCAGACA GTCCGAGTCT GCCGCGCCGC TGCAGAGATG GCGCGCTCAT 5047 

GTATTCACGA GCCGTTGGCT TCATCTGCCG CCAGTGCCGA CTTGAAGCGC ATACGCTCTA 5107 

CCTCGGACTC TGTTCCCGAT GTAAAGATCA GCAAGAGCGC ATGAAGGAAC AAAATTAGTT 5167 

TCCTTGTTCG TAAACAAGGT GGTCCCTCCC ATTGAGGTAA AGACTCTGGT GAGTCCTCAA 5227 

CGTTACTCGT TGAGTCTGCT GCGGTTCGAT TCCATTCCCA AGCAGCAAAG GGTGCGCAAC 5287 

TAGTACGGCG CCCCCTGGGA TACCA 5312 

(2) INFORMATION FOR SEQ ID NO: 44: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Met 


Ser 


Gly 


Gin 


Lys 


Arg 


Arg 


Lys 


Asp Arg 


Gin 


Phe 


Val 


Asn 


Cys 


Pro 


1 








5 










10 










15 




His 


Cys 


Ser 


Glu 
20 


Pro 


Leu 


He 


Leu 


He 
25 


Phe 


Gly 


Lys 


Ser 


Ser 
30 


Thr 


Gly 


Asp 


Arg 


Ala 
35 


Asp 


Cys 


Arg 


Arg 


Cys 
40 


Asp 


He 


Thr 


Phe 


Gly 
45 


Asn 


Pro 


He 


Leu 


Arg 
50 


His 


Gly 


Pro 


Gly 


Cys 
55 


Arg 


Glu 


Asp 


His 


Ala 
60 


Gly 


Leu 


Cys 


Ala 


Phe 


Pro 


Gly 


Phe 


Pro 


Val 


Cys 


Pro 


Cys 


Cys 


Leu 


Glu 


Ala 


Val 


Pro 


Ala 


65 










70 










75 










80 


Pro 


His 


Asp 


Cys 


Pro 
85 


Leu 


Glu 


Arg 


Ser 


His 
90 


Pro 


Ala 


Pro 


Arg 


Val 
95 


Glu 


Gly 


Arg 


Asp 


Arg 
100 


Arg 


Pro 


Thr 


Gly 


Ser 
105 


Gin 


Gly 


Gly 


Pro 


Gly 
110 


Asp 


Ser 


Gly 


Gly 


Arg 
115 


Pro 


Gly 


Ser 


Asp 


Gin 
120 


Arg 


Pro 


Leu 


Ser 


Ser 
125 


Glu 


Asp 


Glu 


Ala 


Phe 
130 


Leu 


Arg 


Arg 


He 


Gin 
135 


Arg 


Leu 


Leu 


Lys 


Asn 
140 











(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4944.. 5162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT ACGCGAAAGC GACAGACGTG 60 

GCGCGTGTCT ACGCCGCGGC AGATGTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 120 

AAGTTGGACT TCGCCCCGCC ACTGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 180 

CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 240 

CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 300 

GGGCCGTCTC TGCACAGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 360 

TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC G AC AC ATT AC GGCCTTAGAG 4 20 

TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 4 80 

GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 54 0 

GTTGGAATTG CCAATCACTC CCTCTATGAC GTGACCCTAG AGGAGCTGGC CAATGCGTTT 600 

GAGAACCACG GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 660 

ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCCTATG 720 

GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 780 

TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 8 40 

AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 900 

CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 960 

CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 1020 

CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 1080 

AACATCTTTT ATTACGCCGA CGCGTCGGGC ACTGAGCATA AGACCATCCT TACGTCACAG 114 0 
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CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA ACTAGTCGAC 1200 

ATGACCGTCT TGATGTCGTT CGCGGGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 1260 

GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 

TACGTCCTCC ACATCATCGA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 1380 

GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 14 4 0 

GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 1500 

CGAGTCCACT CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 

TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 1620 

CTTCGCGAAG CTGGCTGCTA CAACGAACGT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 1680 

CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 17 4 0 

GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 

GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GAT GT AC GAG 18 60 

GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 1920 

AAAGCACTTT ACGTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GCGGATCAAA 1980 

CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 204 0 

GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC .2100 

GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 2160 

CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGCGAACA TGCCGCTCGT TCGCGACGTC 2220 

GTTAAGCAGT GCCGTCGGCG CACTTTCAAC- CAAACCAAGC GCTGTCCGGC CGACGTCGTT 2280 

GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 2340 

TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 24 00 

CAGGAGGAAA AGTCGCGCCA CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGCAGGGA 24 60 

CGCACTTTTG CGTCTGTCAT TCTGCATTAC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 2520 

GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 2580 

GACCCGACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 2 64 0 

GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 27 00 

GAAGTGATGG CAACGATACC CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 27 60 

CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 2820 

GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 2880 

AAGCCGCATA GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 2 94 0 

AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 

CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 

TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 

CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 3180 

GACTTCCTTA TGAAGACTCA GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 3240 

GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 3300 

ATACTCGAGG AGATACTCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 3360 

GACGAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 34 20 

GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA ACACGAGTGA GCTGCTCTTC 34 80 

GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 35 4 0 

TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 

GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCTGCCGC CGTCATGCTC 3660 

ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT C.CTCTGTGGT 3720 

AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 37 80 

CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 384 0 

CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 3900 

GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GTGAGAGACA TCACCAAGGG CTGGAGTGAC 3960 

GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 4 020 

TCTGCGGCGT ACATCATCGA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 4 080 

CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 414 0 

GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 4 200 

GCAGGTTTCG TTGCGACATG TGCGAAGCCG GAAACGCCTT CTTCACTTAC CGCGAAAGCT 42 60 

GGTGTTTCTG CGACTACAAG CCACGTTGCG ACTGGGACTG CGCCCCCGGA GTCTCCATGG 4 320 

GATGCACCTG CAGCCAACAG CTTTTCGGAG TTATTGACAC CGGAGACCCC GTCCACATCA 4 380 

TCCTCGCCGT CATCGTCTTC ATCGGACTCC TCTACATCGT GTGGAAGGTC GCTCAGTGGT 4 4 40 

GGAGACACCG CAAGGACCAC AGAAGACTTG AACAGCAGAA AGCCGCCTTC GCAAGACAGG 4 500 

CAATCACGCT CGTCTGAATG TCTGGACAGA AGCGGAGAAA GGACAGGCAG TTCGTTAACT 4 5 60 
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GCCCCCACTG CTCCGAGCCC CTCATTCTCA TTTTCGGAAA GAGCTCGACT GGCGACCGGG 4 620 

CCGACTGTCG CCGCTGCGAC ATCACCTTCG GCAACCCCAT CCTGCGCCAC GGACCAGGTT 4 680 

GCCGCGAGGA CCACGCCGGA CTTTGCGCCT TTCCTGGGTT CCCAGTCTGC CCGTGCTGTC 4 74 0 

TCGAAGCCGT ACCGGCCCCC CACGACTGCC CGTTGGAAAG AAGTCACCCC GCTCCACGCG 4 800 

TGGAAGGGCG TGACCGGAGA CCGACCGGAA GTCAGGGAGG ACCCGGAGAC AGCGGCGGTC 4 8 60 

GTCCAGGCTC TGATCAGCGG CCGTTATCCT CAGAAGACGA AGCTTTCCTC CGACGCATCC 4 920 

AAAGGCTACT CAAGAACTAA GGG ATG CTC ACA ATC CAC CTC TTT TCC TGC 4 97 0 

Met Leu Thr lie His Leu Phe Ser Cys 
1 5 

CCC GAG TGC GGA TTA CCA GGC CCG CGA CTG CCA GAC AGT CCG AGT CTG 5018 
Pro Glu Cys Gly Leu Pro Gly Pro Arg Leu Pro Asp Ser Pro Ser Leu 
10 15 20 25 

CCG CGC CGC TGC AGA GAT GGC GCG CTC ATG TAT TCA CGA GCC GTT GGC 5066 
Pro Arg Arg Cys Arg Asp Gly Ala Leu Met Tyr Ser Arg Ala Val Gly 

30 35 40 

TTC ATC TGC CGC CAG TGC CGA CTT GAA GCG CAT ACG CTC TAC CTC GGA 5114 
Phe lie Cys Arg Gin Cys Arg Leu Glu Ala His Thr Leu Tyr Leu Gly 

45 50 55 

CTC TGT TCC CGA TGT AAA GAT CAG CAA GAG CGC ATG AAG GAA CAA AAT 5162 
Leu Cys Ser Arg Cys Lys Asp Gin Gin Glu Arg Met Lys Glu Gin Asn 

60 65 70 

TAGTTTCCTT GTTCGTAAAC AAGGTGGTCC CTCCCATTGA GGTAAAGACT CTGGTGAGTC 5222 

CTCAACGTTA CTCGTTGAGT CTGCTGCGGT TCGATTCCAT TCCCAAGCAG CAAAGGGTGC 5282 

GCAACTAGTA CGGCGCCCCC T GG GAT AC C A 5312 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 



Met 


Leu 


Thr 


He 


His 


Leu 


Phe 


Ser 


Cys 


Pro 


Glu 


Cys 


Gly 


Leu 


Pro 


Gly 


1 








5 










10 










15 




Pro 


Arg 


Leu 


Pro 


Asp 


Ser 


Pro 


Ser 


Leu 


Pro 


Arg 


Arg 


Cys 


Arg 


Asp 


Gly 








20 










25 










30 






Ala 


Leu 


Met 


Tyr 


Ser 


Arg 


Ala 


Val 


Gly 


Phe 


He 


Cys 


Arg 


Gin 


Cys 


Arg 






35 










40 










45 








Leu 


Glu 


Ala 


His 


Thr 


Leu 


Tyr 


Leu 


Gly 


Leu 


Cys 


Ser 


Arg 


Cys 


Lys 


Asp 




50 










55 










60 










Gin 


Gin 


Glu 


Arg 


Met 


Lys 


Glu 


Gin 


Asn 

















65 70 

(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 283. .753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GTTTTTCTTT CTTTACCAAG TGTGGTAAAA TTTAAACAAA GAAGAAAACC AGGACCGTAA 60 
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CCCGGCCCTT ACACACCTCG AGTCCGTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 120 

GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 180 

GTCGACAGAC GCTTCCGGAC CACTAGAACC TCCTCGAGCG ACGCACACAC AGCACACACA 240 

CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TT ATG AGC GAG CAC 294 

Met Ser Glu His 
1 

ACC ATC GCC CAC TCC ATC ACA TTA CCA CCC GGT TAC ACC CTT GCC CTA 342 
Thr lie Ala His Ser lie Thr Leu Pro Pro Gly Tyr Thr Leu Ala Leu 
5 10 15 20 

ATA CCC CCT GAA CCT GAA GCA GGA TGG GAG ATG CTG GAG TGG CGT CAC 390 
lie Pro Pro Glu Pro Glu Ala Gly Trp Glu Met Leu Glu Trp Arg His 

25 30 35 

AGC GAC CTC ACA ACC GTC GCG GAA CCC GTA ACG TTC GGG TCA GCG CCA 438 
Ser Asp Leu Thr Thr Val Ala Glu Pro Val Thr Phe Gly Ser Ala Pro 

40 45 50 

ACA CCG TCA CCG TCA ATG GTA GAA GAA ACC AAC GGC GTC GGA CCG GAA 48 6 
Thr Pro Ser Pro Ser Met Val Glu Glu Thr Asn Gly Val Gly Pro Glu 

55 60 65 

GGC AAG TTT CTC CCC CTG ACA ATT TCA CCG CTG CTG CAC AAG ACC TCG 534 
Gly Lys Phe Leu Pro Leu Thr lie Ser Pro Leu Leu His Lys Thr Ser 

70 75 80 

CGC AAA GCC TTG ACG CCA ACA CCG TCA CTT TCC CCG CTA ACA TCT CTA 582 
Arg Lys Ala Leu Thr Pro Thr Pro Ser Leu Ser Pro Leu Thr Ser Leu 
85 90 95 100 

GCA TGC CCG AAT TCC. GGA ATT GGG CCA AGG GAA AGA TCG ACC TCG ACT 630 
Ala Cys Pro Asn Ser Gly lie Gly Pro Arg Glu Arg Ser Thr Ser Thr 

105 110 115 

CCG ATT CCA TCG GCT GGT ACT TCA AGT ACC TTG ACC CAG CGG GTG CTA 678 
Pro lie Pro Ser Ala Gly Thr Ser Ser Thr Leu Thr Gin Arg Val Leu 

120 125 130 

CAG AGT CTG CGC GCG CCG TCG GCG AGT ACT CGA AGA TCC CTG ACG GCC 72 6 
Gin Ser Leu Arg Ala Pro Ser Ala Ser Thr Arg Arg Ser Leu Thr Ala 

135 140 145 

TCG TCA AGT TCT CCG TCG ACG CAG AGA TAAGAGAGAT CTATAACGAG 77 3 
Ser .Ser Ser Ser Pro Ser Thr Gin Arg 

150 155 

GAGTGCCCCG TCGTCACTGA CGTGTCCGTC CCCCTCGACG GCCGCCAGTG GAGCCTCTCG 833 

ATTTTCTCCT TTCCGATGTT CAGAACCGCC TACGTCGCCG TAGCGAACGT CGAGAACAAG 8 93 

GAGATGTCGC TCGACGTTGT CAACGACCTC ATCGAGTGGC TCAACAATCT CGCCGACTGG 953 

CGTTATGTCG TTGACTCTGA ACAGTGGATT AACTTCACCA ATGACACCAC GTACTACGTC 1013 

CGCATCCGCG TTCTACGTCC AAC C TAC GAC GTTCCAGACC CCACAGAGGG CCTTGTTCGC 107 3 

ACAGTCTCAG ACTACCGCCT CAC T TAT AAG GC GAT AAC AT GTGAAGCCAA CAT GCC AAC A 1133 

CTCGTCGACC AAGGCTTTTG GATCGGCGGC CAGTACGCTC TCACCCCGAC TAGCCTACCG 1193 

CAGTACGACG TCAGCGAGGC CTACGCTCTG CACACTTTGA CCTTCGCCAG ACCATCCAGC 1253 

GCCGCTGCAC TCGCGTTTGT GTGGGCAGGT TT GCC AC AGG GTGGCACTGC GCCTGCAGGC 1313 
ACTCCAGCCT GGGAGCAGGC ATCCTCGGGT GGCTACCTCA CCTGGCGCCA CAACGGTACT . 137 3 

ACTTTCCCAG CTGGCTCCGT TAGCTACGTT CTCCCTGAGG GTTTCGCCCT TGAGCGCTAC 14 33 

GACCCGAACG ACGGCTCTTG GACCGACTTC GCTTCCGCAG GAGACACCGT CACTTTCCGG 14 93 

CAGGTCGCCG TCGACGAGGT CGTTGTGACC AACAACCCCG CCGGCGGCGG CAGCGCCCCC 1553 

ACCTTCACCG TGAGAGTGCC CCCTTCAAAC GCTTACACCA ACACCGTGTT TAGGAACACG 1613 

CTCTTAGAGA CTCGACCCTC CTCTCGTAGG CTCGAACTCC CTATGCCACC TGCTGACTTT 167 3 

GGACAGACGG TCGCCAACAA C CCG AAG ATC GAG CAG T CGC TTCTTAAAGA AACACTTGGC 1733 

TGCTATTTGG TCCACTCCAA AATGCGAAAC CCCGTTTTCC AGCTCACGCC AGCCAGCTCC 17 93 

TTTGGCGCCG TTTCCTTCAA CAATCCGGGT TATGAGCGCA CACGCGACCT CCCGGACTAC 1853 

ACTGGCATCC GTGACTCATT C GAC CAG AAC ATGTCCACCG CTGTGGCCCA CTTCCGCTCA 1913 

CTCTCCCACT CCTGCAGTAT CGTCACTAAG ACCTACCAGG GTTGGGAAGG CGTCACGAAC 197 3 

GTCAACACGC CTTTCGGCCA ATTCGCGCAC GCGGGCCTCC TCAAGAATGA GGAGATCCTC 2033 
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TGCCTCGCCG ACGACCTGGC CACCCGTCTC ACAGGTGTCT ACCCCGCCAC TGACAACTTC 2093 

GCGGCCGCCG TTTCTGCCTT CGCCGCGAAC ATGCTGTCCT CCGTGCTGAA GTCGGAGGCA 2153 

ACGTCCTCCA TCATCAAGTC CGTTGGCGAG ACTGCCGTCG GCGCGGCTCA GTCCGGCCTC 2213 

GCGAAGCTAC CCGGACTGCT AATGAGTGTA CCAGGGAAGA TTGCCGCGCG TGTCCGCGCG 2273 

CGCCGAGCGC GCCGCCGCGC CGCTCGTGCC AATTAGTTTG CTCGCTCCTG TTTCGCCGTT 2333 

TCGTAAAACG GCGTGGTCCC GCACATTACG CGTACCCTAA AGACTCTGGT GAGTCCCCGT 2393 

CGTTACACGA CGGGTCTGCC GCGGTTCGAT TCCATTCCCA AGCGGCAAGA AGGACGTAGT 24 53 

TAGCTCTGCG TCCCTCGGGA TACCA 24 78 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Met 


Ser 


Glu 


His 


Thr 


He 


Ala 


His 


Ser 


He 


Thr 


Leu 


Pro 


Pro 


Gly 


Tyr 


1 








5 










10 










15 




Thr 


Leu 


Ala 


Leu 


He 


Pro 


Pro 


Glu 


Pro 


Glu 


Ala 


Gly 


Trp 


Glu 


Met 


Leu 








20 










25 










30 






Glu 


Trp 


Arg 


His 


Ser 


Asp 


Leu 


Thr 


Thr 


Val 


Ala 


Glu 


Pro 


Val 


Thr 


Phe 






35 










40 










45 








Gly 


Ser 


Ala 


Pro 


Thr 


Pro 


Ser 


Pro 


Ser 


Met 


Val 


Glu 


Glu 


Thr 


Asn 


Gly 




50 










55 










60 










Val 


Gly 


Pro 


Glu 


Gly 


Lys 


Phe 


Leu 


Pro 


Leu 


Thr 


He 


Ser 


Pro 


Leu 


Leu 


65 










70 










75 










80 


His 


Lys 


Thr 


Ser 


Arg 


Lys 


Ala 


Leu 


Thr 


Pro 


Thr 


Pro 


Ser 


Leu 


Ser 


Pro 










85 










90 










95 




Leu 


Thr 


Ser 


Leu 


Ala 


Cys 


Pro 


Asn 


Ser 


Gly 


He 


Gly 


Pro 


Arg 


Glu 


Arg 








100 










105 










110 






Ser 


Thr 


Ser 


Thr 


Pro 


He 


Pro 


Ser 


Ala 


Gly 


Thr 


Ser 


Ser 


Thr 


Leu 


Thr 






115 










120 










125 








Gin 


Arg 


Val 


Leu 


Gin 


Ser 


Leu 


Arg 


Ala 


Pro 


Ser 


Ala 


Ser 


Thr 


Arg 


Arg 




130 










135 










140 










Ser 


Leu 


Thr 


Ala 


Ser 


Ser 


Ser 


Ser 


Pro 


Ser 


Thr 


Gin 


Arg 









145 150 155 



(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 366.. 2306 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTTTTTCTTT CTTTACCAAG TGTGGTAAAA TTTAAACAAA GAAGAAAACC AGGACCGTAA 60 

CCCGGCCCTT ACACACCTCG AGTCCGTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 120 

GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 180 

GTCGACAGAC GCTTCCGGAC CACTAGAACC TCCTCGAGCG ACGCACACAC AG C AC AC AC A 240 

CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TTATGAGCGA GCACACCATC 300 

GCCCACTCCA TCACATTACC ACCCGGTTAC ACCCTTGCCC TAATACCCCC TGAACCTGAA 360 



28 



GCAGG ATG GGA GAT GCT GGA GTG GCG TCA CAG CGA CCT CAC AAC CGT 4 07 

Met Gly Asp Ala Gly Val Ala Ser Gin Arg Pro His Asn Arg 







1 








5 








10 












CGC 


GGA 


ACC 


CGT 


AAC 


GTT 


CGG 


GTC 


AGC 


GCC 


AAC 


ACC 


GTC 


ACC 


GTC 


AAT 


455 


Arg 


Gly 


Thr 


Arg 


Asn 


Val 


Arg 


Val 


Ser 


Ala 


Asn 


Thr 


Val 


Thr 


Val 


Asn 




15 










20 










25 










30 




GGT 


AGA 


AGA 


AAC 


CAA 


CGG 


CGT 


CGG 


ACC 


GGA 


AGG 


CAA 


GTT 


TCT 


CCC 


CCT 


503 


Gly 


Arg 


Arg 


Asn 


Gin 
35 


Arg 


Arg 


Arg 


Thr 


Gly 
40 


Arg 


Gin 


Val 


Ser 


Pro 
45 


Pro 




GAC 


AAT 


TTC 


ACC 


GCT 


GCT 


GCA 


CAA 


GAC 


CTC 


GCG 


CAA 


AGC 


CTT 


GAC 


GCC 


551 


Asp 


Asn 


Phe 


Thr 
50 


Ala 


Ala 


Ala 


Gin 


Asp 
55 


Leu 


Ala 


Gin 


Ser 


Leu 
60 


Asp 


Ala 




AAC 


ACC 


GTC 


ACT 


TTC 


CCC 


GCT 


AAC 


ATC 


TCT 


AGC 


ATG 


CCC 


GAA 


TTC 


CGG 


599 


Asn 


Thr 


Val 
65 


Thr 


Phe 


Pro 


Ala 


Asn 
70 


He 


Ser 


Ser 


Met 


Pro 
75 


Glu 


Phe 


Arg 




AAT 


TGG 


GCC 


AAG 


GGA 


AAG 


ATC 


GAC 


CTC 


GAC 


TCC 


GAT 


TCC 


ATC 


GGC 


TGG 


647 


Asn 


Trp 
80 


Ala 


Lys 


Gly 


Lys 


He 
85 


Asp 


Leu 


Asp 


Ser 


Asp 
90 


Ser 


He 


Gly 


Trp 




TAC 


TTC 


AAG 


TAC 


CTT 


GAC 


CCA 


GCG 


GGT 


GCT 


ACA 


GAG 


TCT 


GCG 


CGC 


GCC 


695 


Tyr 


Phe 


Lys 


Tyr 


Leu 


Asp 


Pro 


Ala 


Gly 


Ala 


Thr 


Glu 


Ser 


Ala 


Arg 


Ala 




95 










100 










105 










110 




GTC 


GGC 


GAG 


TAC 


TCG 


AAG 


ATC 


CCT 


GAC 


GGC 


CTC 


GTC 


AAG 


TTC 


TCC 


GTC 


743 


Val 


Gly 


Glu 


Tyr 


Ser 
115 


Lys 


He 


Pro 


Asp 


Gly 
120 


Leu 


Val 


Lys 


Phe 


Ser 
125 


Val 




GAC 


GCA 


GAG 


ATA 


AGA 


GAG 


ATC 


TAT 


AAC 


GAG 


GAG 


TGC 


CCC 


GTC 


GTC 


ACT 


791 


Asp 


Ala 


Glu 


He 
130 


Arg 


Glu 


He 


Tyr 


Asn 
135 


Glu 


Glu 


Cys 


Pro 


Val 
140 


Val 


Thr 




GAC 


GTG 


TCC 


GTC 


CCC 


CTC 


GAC 


GGC 


CGC 


CAG 


TGG 


AGC 


CTC 


TCG 


ATT 


TTC 


839 


Asp 


Val 


Ser 
145 


Val 


Pro 


Leu 


Asp 


Gly 
150 


Arg 


Gin 


Trp 


Ser 


Leu 
155 


Ser 


He 


Phe 




TCC 


TTT 


CCG 


ATG 


TTC 


AGA 


ACC 


GCC 


TAC 


GTC 


GCC 


GTA 


GCG 


AAC 


GTC 


GAG 


887 


Ser 


Phe 
160 


Pro 


Met 


Phe 


Arg 


Thr 
165 


Ala 


Tyr 


Val 


Ala 


Val 
170 


Ala 


Asn 


Val 


Glu 




AAC 


AAG 


GAG 


ATG 


TCG 


CTC 


GAC 


GTT 


GTC 


AAC 


GAC 


CTC 


ATC 


GAG 


TGG 


CTC 


935 


Asn 


Lys 


Glu 


Met 


Ser 


Leu 


Asp 


Val 


Val 


Asn 


Asp 


Leu 


He 


Glu 


Trp 


Leu 




175 










180 










185 










190 




AAC 


AAT 


CTC 


GCC 


GAC 


TGG 


CGT 


TAT 


GTC 


GTT 


GAC 


TCT 


GAA 


CAG 


TGG 


ATT 


983 


Asn 


Asn 


Leu 


Ala 


Asp 
195 


Trp 


Arg 


Tyr 


Val 


Val 
200 


Asp 


Ser 


Glu 


Gin 


Trp 
205 


He 




AAC 


TTC 


ACC 


AAT 


GAC 


ACC 


ACG 


TAC 


TAC 


GTC 


CGC 


ATC 


CGC 


'GTT 


CTA 


CGT 


1031 


Asn 


Phe 


Thr 


Asn 
210 


Asp 


Thr 


Thr 


Tyr 


Tyr 
215 


Val 


Arg 


He 


Arg 


Val 
220 


Leu 


Arg 




CCA 


ACC 


TAC 


GAC 


GTT 


CCA 


GAC 


CCC 


ACA 


GAG 


GGC 


CTT 


GTT 


CGC 


ACA 


GTC 


1079 


Pro 


Thr 


Tyr 
225 


Asp 


Val 


Pro 


Asp 


Pro 
230 


Thr 


Glu 


Gly 


Leu 


Val 
235 


Arg 


Thr 


Val 




TCA 


GAC 


TAC 


CGC 


CTC 


ACT 


TAT 


AAG 


GCG 


ATA 


ACA 


TGT 


GAA 


GCC 


AAC 


ATG 


1127 


Ser 


Asp 
240 


Tyr 


Arg 


Leu 


Thr 


Tyr 
245 


Lys 


Ala 


He 


Thr 


Cys 
250 


Glu 


Ala 


Asn 


Met 




CCA 


ACA 


CTC 


GTC 


GAC 


CAA 


GGC 


TTT 


TGG 


ATC 


GGC 


GGC 


CAG 


TAC 


GCT 


CTC 


1175 


Pro 


Thr 


Leu 


Val 


Asp 


Gin 


Gly 


Phe 


Trp 


He 


Gly 


Gly 


Gin 


Tyr 


Ala 


Leu 




255 










260 










265 










270 




ACC 


CCG 


ACT 


AGC 


CTA 


CCG 


CAG 


TAC 


GAC 


GTC 


AGC 


GAG 


GCC 


TAC 


GCT 


CTG 


1223 


Thr 


Pro 


Thr 


Ser 


Leu 
275 


Pro 


Gin 


Tyr 


Asp 


Val 
280 


Ser 


Glu 


Ala 


Tyr 


Ala 
285 


Leu 




CAC 


ACT 


TTG 


ACC 


TTC 


GCC 


AGA 


CCA 


TCC 


AGC 


GCC 


GCT 


GCA 


CTC 


GCG 


TTT 


1271 


His 


Thr 


Leu 


Thr 
290 


Phe 


Ala 


Arg 


Pro 


Ser 
295 


Ser 


Ala 


Ala 


Ala 


Leu 
300 


Ala 


Phe 





29 



GTG 
Val 

GCC 
Ala 

GGT 
Gly 
335 
TTC 
Phe 

GCT 
Ala 

GTC 
Val 

ACC 
Thr 

AAC 
Asn 
415 
ATG 
Met 

GAG 
Glu 

AAA 
Lys 

GCC 
Ala 

GAC 
Asp 
495 
GTG 
Val 

ACC 
Thr 

CAA 
Gin 

GCC 
Ala 

AAC 
Asn 
575 
GTG 
Val 



TGG 
Trp 

TGG 
Trp 
320 
ACT 
Thr 

GCC 
Ala 

TCC 
Ser 

GTT 
Val 

GTG 
Val 
400 
ACG 
Thr 

CCA 
Pro 

CAG 
Gin 

ATG 
Met 

GTT 
Val 
480 
TAC 
Tyr 

GCC 
Ala 

TAC 
Tyr 

TTC 
Phe 

GAC 
Asp 
560 
TTC 
Phe 

CTG 
Leu 



GCA GGT 
Ala Gly 
305 

GAG CAG 
Glu Gin 

ACT TTC 
Thr Phe 

CTT GAG 
Leu Glu 



GCA 
Ala 

GTG 
Val 
385 
AGA 
Arg 



GGA 
Gly 
370 
ACC 
Thr 

GTG 
Val 



CTC TTA 
Leu Leu 

CCT GCT 
Pro Ala 



TCG 
Ser 

CGA 
Arg 
465 
TCC 
Ser 



CTT 
Leu 
450 
AAC 
Asn 

TTC 
Phe 



ACT GGC 
Thr Gly 

CAC TTC 
His Phe 



CAG 
Gin 

GCG 
Ala 
545 
GAC 
Asp 



GGT 
Gly 
530 
CAC 
His 

CTG 
Leu 



GCG GCC 
Ala Ala 

AAG TCG 
Lys Ser 



TTG 
Leu 

GCA 
Ala 

CCA 
Pro 

CGC 
Arg 
355 
GAC 
Asp 

AAC 
Asn 

CCC 
Pro 

GAG 
Glu 

GAC 
Asp 
435 
CTT 
Leu 

CCC 
Pro 

AAC 
Asn 

ATC 
He 

CGC 
Arg 
515 
TGG 
Trp 

GCG 
Ala 

GCC 
Ala 

GCC 
Ala 

GAG 
Glu 
595 



CCA CAG 
Pro Gin 

TCC TCG 
Ser Ser 
325 
GCT GGC 
Ala Gly 
340 

TAC GAC 
Tyr Asp 

ACC GTC 
Thr Val 

AAC CCC 
Asn Pro 

CCT TCA 
Pro Ser 
405 
ACT CGA 
Thr Arg 
420 

TTT GGA 
Phe Gly 

AAA GAA 
Lys Glu 

GTT TTC 
Val Phe 



GGT GGC ACT GCG 
Gly Gly Thr Ala 
310 

GGT GGC TAC CTC 
Gly Gly Tyr Leu 



TCC GTT 
Ser Val 

CCG AAC 
Pro Asn 



ACT 
Thr 

GCC 
Ala 
390 
AAC 
Asn 



TTC 
Phe 
375 
GGC 
Gly 

GCT 
Ala 



AAT 
Asn 

CGT 
Arg 
500 
TCA 
Ser 



CCG 
Pro 
485 
GAC 
Asp 

CTC 
Leu 



CCC TCC 
Pro Ser 

CAG ACG 
Gin Thr 

ACA CTT 
Thr Leu 
455 
CAG CTC 
Gin Leu 
470 

GGT TAT 
Gly Tyr 



AGC TAC 
Ser Tyr 
345 
GAC GGC 
Asp Gly 
360 

CGG CAG 
Arg Gin 

GGC GGC 
Gly Gly 

TAC ACC 
Tyr Thr 

TCT CGT 
Ser Arg 
425 
GTC GCC 
Val Ala 
440 

GGC TGC 
Gly Cys 

ACG CCA 
Thr Pro 

GAG CGC 
Glu Arg 



GAA GGC 
Glu Gly 

GGC CTC 
Gly Leu 

ACC CGT 
Thr Arg 
565 
GTT TCT 
Val Ser 
580 

GCA ACG 
Ala Thr 



TCA 
Ser 

TCC 
Ser 

GTC 
Val 

CTC 
Leu 
550 
CTC 
Leu 



TTC GAC 
Phe Asp 



CAC 
His 

ACG 
Thr 
535 
AAG 
Lys 



TCC 
Ser 
520 
AAC 
Asn 

AAT 
Asn 



CAG 
Gin 
505 
TGC 
Cys 

GTC 
Val 

GAG 
Glu 



ACA GGT GTC 
Thr Gly Val 



GCC TTC GCC GCG 
Ala Phe Ala Ala 
585 

TCC TCC ATC ATC 
Ser Ser He He 
600 



CCT 
Pro 

ACC 
Thr 
330 
GTT 
Val 

TCT 
Ser 

GTC 
Val 

AGC 
Ser 

AAC 
Asn 
410 
AGG 
Arg 

AAC 
Asn 

TAT 
Tyr 

GCC 
Ala 

ACA 
Thr 
490 
AAC 
Asn 

AGT 
Ser 

AAC 
Asn 

GAG 
Glu 

TAC 
Tyr 
570 
AAC 
Asn 

AAG 
Lys 



GCA GGC ACT CCA 
Ala Gly Thr Pro 
315 

TGG CGC CAC AAC 
Trp Arg His Asn 



CTC 
Leu 

TGG 
Trp 

GCC 
Ala 

GCC 
Ala 
395 
ACC 
Thr 



CCT 
Pro 

ACC 
Thr 

GTC 
Val 
380 
CCC 
Pro 



GAG 
Glu 

GAC 
Asp 
365 
GAC 
Asp 



GGT 
Gly 
350 
TTC 
Phe 

GAG 
Glu 



ACC TTC 
Thr Phe 



GTG TTT AGG 
Val Phe Arg 



CTC GAA CTC 
Leu Glu Leu 



AAC CCG 
Asn Pro 



TTG 
Leu 

AGC 
Ser 
475 
CGC 
Arg 



GTC 
Val 
4 60 
TCC 
Ser 

GAC 
Asp 



ATG TCC 
Met Ser 

ATC GTC 
He Val 



ACG 
Thr 

ATC 
He 
555 
CCC 
Pro 



CCT 
Pro 
540 
CTC 
Leu 

GCC 
Ala 



AAG 
Lys 
445 
CAC 
His 

TTT 
Phe 

CTC 
Leu 

ACC 
Thr 

ACT 
Thr 
525 
TTC 
Phe 



CCT 
Pro 
430 
ATC 
He 

TCC 
Ser 

GGC 
Gly 

CCG 
Pro 

GCT 
Ala 
510 
AAG 
Lys 

GGC 
Gly 



TGC CTC 
Cys Leu 

ACT GAC 
Thr Asp 



ATG CTG TCC TCC 
Met Leu Ser Ser 
590 

TCC GTT GGC GAG 
Ser Val Gly Glu 
605 



1319 



1367 



1415 



1463 



1511 



1559 



1607 



1655 



1703 



1751 



1799 



1847 



1895 



1943 



1991 



2039 



2087 



2135 



2183 



30 



1 



ACT GCC GTC GGC GCG GCT CAG TCC GGC CTC GCG AAG CTA CCC GGA CTG 2231 
Thr Ala Val Gly Ala Ala Gin Ser Gly Leu Ala Lys Leu Pro Gly Leu 

610 615 620 

CTA ATG AGT GTA CCA GGG AAG ATT GCC GCG CGT GTC CGC GCG CGC CGA 227 9 

Leu Met Ser Val Pro Gly Lys lie Ala Ala Arg Val Arg Ala Arg Arg 

625 630 635 

GCG CGC CGC CGC GCC GCT CGT GCC AAT -TAGTTTGCTC GCTCCTGTTT 2326 
Ala Arg Arg Arg Ala Ala Arg Ala Asn 

640 645 
CGCCGTTTCG TAAAACGGCG TGGTCCCGCA CATTACGCGT ACCCTAAAGA CTCTGGTGAG 238 6 
TCCCCGTCGT TACACGACGG GTCTGCCGCG GTTCGATTCC ATTCCCAAGC GGCAAGAAGG 24 4 6 
AC GT AGT TAG CTCTGCGTCC CTCGGGATAC CA 24 78 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 647 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



Met 


Gly 


Asp 


Ala 


Gly 


Val 


Ala 


Ser 


Gin 


Arg 


Pro 


His 


Asn 


Arg 


Arg 


Gly 


1 








5 










10 










15 




Thr 


Arg 


Asn 


Val 


Arg 


Val 


Ser 


Ala 


Asn 


Thr 


Val 


Thr 


Val 


Asn 


Gly Arg 








20 










25 










30 






Arg 


Asn 


Gin 


Arg 


Arg 


Arg 


Thr 


Gly 


Arg 


Gin 


Val 


Ser 


Pro 


Pro 


Asp 


Asn 






35 










40 










45 








Phe 


Thr 


Ala 


Ala 


Ala 


Gin 


Asp 


Leu 


Ala 


Gin 


Ser 


Leu 


Asp 


Ala 


Asn 


Thr 




50 










55 










60 










Val 


Thr 


Phe 


Pro 


Ala 


Asn 


He 


Ser 


Ser 


Met 


Pro 


Glu 


Phe 


Arg 


Asn 


Trp 


65 










70 










75 










80 


Ala 


Lys 


Gly 


Lys 


He 


Asp 


Leu 


Asp 


Ser 


Asp 


Ser 


He 


Gly 


Trp 


Tyr 


Phe 










85 










90 










95 




Lys 


Tyr 


Leu 


Asp 


Pro 


Ala 


Gly 


Ala 


Thr 


Glu 


Ser 


Ala 


Arg 


Ala 


Val 


Gly 








100 










105 










110 






Glu 


Tyr 


Ser 


Lys 


He 


Pro 


Asp 


Gly 


Leu 


Val 


Lys 


Phe 


Ser 


Val 


Asp 


Ala 






115 










120 










125 








Glu 


He 


Arg 


Glu 


He 


Tyr 


Asn 


Glu 


Glu 


Cys 


Pro 


Val 


Val 


Thr 


Asp 


Val 




130 










135 










140 










Ser 


Val 


Pro 


Leu 


Asp 


Gly 


Arg 


Gin 


Trp 


Ser 


Leu 


Ser 


He 


Phe 


Ser 


Phe 


145 










150 










155 










160 


Pro 


Met 


Phe 


Arg 


Thr 


Ala 


Tyr 


Val 


Ala 


Val 


Ala 


Asn 


Val 


Glu 


Asn 


Lys 










165 










170 










175 




Glu 


Met 


Ser 


Leu 


Asp 


Val 


Val 


Asn 


Asp 


Leu 


He 


Glu 


Trp 


Leu 


Asn 


Asn 








180 










185 










190 






Leu 


Ala 


Asp 


Trp 


Arg 


Tyr 


Val 


Val 


Asp 


Ser 


Glu 


Gin 


Trp 


He 


Asn 


Phe 






195 










200 










205 








Thr 


Asn 


Asp 


Thr 


Thr 


Tyr 


Tyr 


Val 


Arg 


He 


Arg 


Val 


Leu 


Arg 


Pro 


Thr 




210 










215 










220 










Tyr 


Asp 


Val 


Pro 


Asp 


Pro 


Thr 


Glu 


Gly 


Leu 


Val 


Arg 


Thr 


Val 


Ser 


Asp 


225 










230 










235 










240 


Tyr 


Arg 


Leu 


Thr 


Tyr 


Lys 


Ala 


He 


Thr 


Cys 


Glu 


Ala 


Asn 


Met 


Pro 


Thr 










245 










250 










255 




Leu 


Val 


Asp 


Gin 


Gly 


Phe 


Trp 


He 


Gly 


Gly 


Gin 


Tyr 


Ala 


Leu 


Thr 


Pro 








260 










265 










270 






Thr 


Ser 


Leu 


Pro 


Gin 


Tyr 


Asp 


Val 


Ser 


Glu 


Ala 


Tyr 


Ala 


Leu 


His 


Thr 






275 










280 










285 









31 



Leu 


Thr 


Phe 


Ala 


Arg 


Pro 


Ser 


Ser 


Ala 


Ala 


Ala 


Leu 


Ala 


Phe 


Val 


Trp 




290 










295 










300 










Ala 


Gly 


Leu 


Pro 


Gin 


Gly 


Gly 


Thr 


Ala 


Pro 


Ala 


Gly 


Thr 


Pro 


Ala 


Trp 


305 










310 










315 










320 


Glu 


Gin 


Ala 


Ser 


Ser 


Gly 


Gly 


Tyr 


Leu 


Thr 


Trp 


Arg 


His 


Asn 


Gly 


Thr 










325 










330 










335 




Thr 


Phe 


Pro 


Ala 


Gly 


Ser 


Val 


Ser 


Tyr 


Val 


Leu 


Pro 


Glu 


Gly 


Phe 


Ala 








340 










345 










350 






Leu 


Glu 


Arg 


Tyr 


Asp 


Pro 


Asn 


Asp 


Gly 


Ser 


Trp 


Thr 


Asp 


Phe 


Ala 


Ser 






355 










360 










365 








Ala 


Gly 


Asp 


Thr 


Val 


Thr 


Phe 


Arg 


Gin 


Val 


Ala 


Val 


Asp 


Glu 


Val 


Val 




370 










375 










380 










Val 


Thr 


Asn 


Asn 


Pro 


Ala 


Gly 


Gly 


Gly 


Ser 


Ala 


Pro 


Thr 


Phe 


Thr 


Val 


385 










390 










395 










400 


Arg 


Val 


Pro 


Pro 


Ser 


Asn 


Ala 


Tyr 


Thr 


Asn 


Thr 


Val 


Phe 


Arg 


Asn 


Thr 










405 










410 










415 




Leu 


Leu 


Glu 


Thr 


Arg 


Pro 


Ser 


Ser 


Arg 


Arg 


Leu 


Glu 


Leu 


Pro 


Met 


Pro 








420 










425 










430 






Pro 


Ala 


Asp 


Phe 


Gly 


Gin 


Thr 


Val 


Ala 


Asn 


Asn 


Pro 


Lys 


He 


Glu 


Gin 






435 










440 










445 








Ser 


Leu 


Leu 


Lys 


Glu 


Thr 


Leu 


Gly 


Cys 


Tyr 


Leu 


Val 


His 


Ser 


Lys 


Met 




450 










455 










460 










Arg 


Asn 


Pro 


Val 


Phe 


Gin 


Leu 


Thr 


Pro 


Ala 


Ser 


Ser 


Phe 


Gly 


Ala 


Val 


465 










470 










475 










480 


Ser 


Phe 


Asn 


Asn 


Pro 


Gly 


Tyr 


Glu 


Arg 


Thr 


Arg 


Asp 


Leu 


Pro 


Asp 


Tyr 










485 










490 










495 




Thr 


Gly 


He 


Arg 


Asp 


Ser 


Phe 


Asp 


Gin 


Asn 


Met 


Ser 


Thr 


Ala 


Val 


Ala 








500 










505 










510 






His 


Phe 


Arg 


Ser 


Leu 


Ser 


His 


Ser 


Cys 


Ser 


He 


Val 


Thr 


Lys 


Thr 


Tyr 






515 










520 










525 








Gin 


Gly 


Trp 


Glu 


Gly 


Val 


Thr 


Asn 


Val 


Asn 


Thr 


Pro 


Phe 


Gly 


Gin 


Phe 




530 










535 










540 










Ala 


His 


Ala 


Gly 


Leu 


Leu 


Lys 


Asn 


Glu 


Glu 


He 


Leu 


Cys 


Leu 


Ala 


Asp 


545 










550 










555 










560 


Asp 


Leu 


Ala 


Thr 


Arg 


Leu 


Thr 


Gly 


Val 


Tyr 


Pro 


Ala 


Thr 


Asp 


Asn 


Phe 










565 










570 










575 




Ala 


Ala 


Ala 


Val 


Ser 


Ala 


Phe 


Ala 


Ala 


Asn 


Met 


Leu 


Ser 


Ser 


Val 


Leu 








580 










585 










590 






Lys 


Ser 


Glu 


Ala 


Thr 


Ser 


Ser 


He 


He 


Lys 


Ser 


Val 


Gly 


Glu 


Thr 


Ala 






595 










600 










605 








Val 


Gly 


Ala 


Ala 


Gin 


Ser 


Gly 


Leu 


Ala 


Lys 


Leu 


Pro 


Gly 


Leu 


Leu 


Met 




610 










615 










620 










Ser 


Val 


Pro 


Gly 


Lys 


He 


Ala 


Ala 


Arg 


Val 


Arg 


Ala 


Arg 


Arg 


Ala 


Arg 


625 










630 










635 










640 


Arg 


Arg 


Ala 


Ala 


Arg 


Ala 


Asn 





















645 



(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 



32 



(B) LOCATION: 283. .2307 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GTTTTTCTTT CTTTACCAAG TGTGGTAAAA TTTAAACAAA GAAGAAAACC AGGACCGTAA 60 
CCCGGCCCTT ACACACCTCG AGTCCGTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 120 
GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 180 
GTCGACAGAC GCTTCCGGAC CACTAGAACC TCCTCGAGCG ACGCACACAC AGCACACACA 240 
CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TT ATG AGC GAG CAC 2 94 

Met. Ser Glu His 
1 

ACC ATC GCC CAC TCC ATC ACA TTA'CCA CCC GGT TAC ACC CTT GCC CTA 342 
Thr lie Ala His Ser lie Thr Leu Pro Pro Gly Tyr Thr Leu Ala Leu 
5 10 15 20 

ATA CCC CCT GAA CCT GAA GCA GGA TGG GAG ATG CTG GAG TGG CGT CAC 390 
lie Pro Pro Glu Pro Glu Ala Gly Trp Glu Met Leu Glu Trp Arg His 

25 30 35 

AGC GAC CTC ACA ACC GTC GCG GAA CCC GTA ACG TTC GGG TCA GCG CCA 4 38 

Ser Asp Leu Thr Thr Val Ala Glu Pro Val Thr Phe Gly Ser Ala Pro 

40 45 50 

ACA CCG TCA CCG TCA ATG GTA GAA GAA ACC AAC GGC GTC GGA CCG GAA 4 86 

Thr Pro Ser Pro Ser Met Val Glu Glu Thr Asn Gly Val Gly Pro Glu 

55 60 65 

GGC AAG TTT CTC CCC CTG ACA ATT TCA CCG CTG CTG CAC AAG ACC TCG 534 
Gly Lys Phe Leu Pro Leu Thr lie Ser Pro Leu Leu His Lys Thr Ser 

70 75 80 

CGC AAA GCC TTG ACG CCA ACA CCG TCA CTT TCC CCC GCT AAC ATC TCT 582 
Arg Lys Ala Leu Thr Pro Thr Pro Ser Leu Ser Pro Ala Asn lie Ser 
85 90 95 100 

AGC ATG CCC GAA TTC CGG AAT TGG GCC AAG GGA AAG ATC GAC CTC GAC 630 
Ser Met Pro Glu Phe Arg Asn Trp Ala Lys Gly Lys lie Asp Leu Asp 

105 110 115 

TCC GAT TCC ATC GGC TGG TAC TTC AAG TAC CTT GAC CCA GCG GGT GCT 678 
Ser Asp Ser lie Gly Trp Tyr Phe Lys Tyr Leu Asp Pro Ala Gly Ala 

120 125 130 

ACA GAG TCT GCG CGC GCC GTC GGC GAG TAC TCG AAG ATC CCT GAC GGC 72 6 

Thr Glu Ser Ala Arg Ala Val Gly Glu Tyr Ser Lys lie Pro Asp Gly 

135 140 145 

CTC GTC AAG TTC TCC GTC GAC GCA GAG ATA AGA GAG ATC TAT AAC GAG 77 4 

Leu Val Lys Phe Ser Val Asp Ala Glu lie Arg Glu lie Tyr Asn Glu 

150 155 160 

GAG TGC CCC GTC GTC ACT GAC GTG TCC GTC CCC CTC GAC GGC CGC CAG 822 
Glu Cys Pro Val Val Thr Asp Val Ser Val Pro Leu Asp Gly Arg Gin 
165 170 175 180 

TGG AGC CTC TCG ATT TTC TCC TTT CCG ATG TTC AGA ACC GCC TAC GTC 870 
Trp Ser Leu Ser lie Phe Ser Phe Pro Met Phe Arg Thr Ala Tyr Val 

185 190 195 

GCC GTA GCG AAC GTC GAG AAC AAG GAG ATG TCG CTC GAC GTT GTC AAC 918 
Ala Val Ala Asn Val Glu Asn Lys Glu Met Ser Leu Asp Val Val Asn 

200 205 210 

GAC CTC ATC GAG TGG CTC AAC AAT CTC GCC GAC TGG CGT TAT GTC GTT 966 
Asp Leu lie Glu Trp Leu Asn Asn Leu Ala Asp Trp Arg Tyr Val Val 

215 220 225 

GAC TCT GAA CAG TGG ATT AAC TTC ACC AAT GAC ACC ACG TAC TAC GTC 1014 
Asp Ser Glu Gin Trp lie Asn Phe Thr Asn Asp Thr Thr Tyr Tyr Val 

230 235 240 

CGC ATC CGC GTT CTA CGT CCA ACC TAC GAC GTT CCA GAC CCC ACA GAG 1062 
Arg lie Arg Val Leu Arg Pro Thr Tyr Asp Val Pro Asp Pro Thr Glu 
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245 
GGC 
Gly- 

ACA 
Thr 

GGC 
Gly 

AGC 
Ser 

GCC 
Ala 
325 
GCG 
Ala 

CTC 
Leu 

TAC 
Tyr 

GGC 
Gly 

CAG 
Gin 
405 
GGC 
Gly 

ACC 
Thr 

CGT 
Arg 

GCC 
Ala 

TGC 
Cys 
485 
CCA 
Pro 

CGC 
Arg 

CAG 
Gin 

TGC 
Cys 



CTT 
Leu 

TGT 
Cys 

GGC 
Gly 

GAG 
Glu 
310 
GCT 
Ala 

CCT 
Pro 

ACC 
Thr 

GTT 
Val 

TCT 
Ser 
390 
GTC 
Val 

AGC 
Ser 

AAC 
Asn 

AGG 
Arg 

AAC 
Asn 
470 
TAT 
Tyr 

GCC 
Ala 

ACA 
Thr 

AAC 
Asn 

AGT 
Ser 



GTT CGC 
Val Arg 



GAA 
Glu 

CAG 
Gin 
295 
GCC 
Ala 



GCC 
Ala 
280 
TAC 
Tyr 

TAC 
Tyr 



250 
ACA GTC 
Thr Val 
265 

AAC ATG 
Asn Met 

GCT CTC 
Ala Leu 

GCT CTG 
Ala Leu 



GCA CTC 
Ala Leu 

GCA GGC 
Ala Gly 

TGG CGC 
Trp Arg 
360 
CTC CCT 
Leu Pro 
375 

TGG ACC 
Trp Thr 

GCC GTC 
Ala Val 

GCC CCC 
Ala Pro 



GCG 
Ala 

ACT 
Thr 
345 
CAC 
His 



TTT 
Phe 
330 
CCA 
Pro 

AAC 
Asn 



ACC. 
Thr 

CTC 
Leu 
455 
AAC 
Asn 



GTG 
Val 
440 
GAA 
Glu 

CCG 
Pro 



GAG GGT 
Glu Gly 

GAC TTC 
Asp Phe 

GAC GAG 
Asp Glu 
410 
ACC TTC 
Thr Phe 
425 

TTT AGG 
Phe Arg 

CTC CCT 
Leu Pro 

AAG ATC 
Lys lie 



TTG GTC 
Leu Val 

AGC TCC 
Ser Ser 



CGC 
Arg 

ATG 
Met 
535 
ATC 
He 



GAC 
Asp 
520 
TCC 
Ser 

GTC 
Val 



CAC 
His 

TTT 
Phe 
505 
CTC 
Leu 



TCC 
Ser 
490 
GGC 
Gly 

CCG 
Pro 



ACC GCT 
Thr Ala 

ACT AAG 
Thr Lys 



TCA 
Ser 

CCA 
Pro 

ACC 
Thr 

CAC 
His 
315 
GTG 
Val 

GCC 
Ala 

GGT 
Gly 

TTC 
Phe 

GCT 
Ala 
395 
GTC 
Val 

ACC 
Thr 

AAC 
Asn 

ATG 
Met 

GAG 
Glu 
475 
AAA 
Lys 

GCC 
Ala 

GAC 
Asp 

GTG 
Val 

ACC 
Thr 



GAC 
Asp 

ACA 
Thr 

CCG 
Pro 
300 
ACT 
Thr 

TGG 
Trp 

TGG 
Trp 

ACT 
Thr 

GCC 
Ala 
380 
TCC 
Ser 

GTT 
Val 

GTG 
Val 

ACG 
Thr 

CCA 
Pro 
460 
CAG 
Gin 

ATG 
Met 

GTT 
Val 

TAC 
Tyr 

GCC 
Ala 
540 
TAC 
Tyr 



TAC 
Tyr 

CTC 
Leu 
285 
ACT 
Thr 

TTG 
Leu 

GCA 
Ala 

GAG 
Glu 

ACT 
Thr 
365 
CTT 
Leu 

GCA 
Ala 

GTG 
Val 

AGA 
Arg 

CTC 
Leu 
445 
CCT 
Pro 

TCG 
Ser 

CGA 
Arg 

TCC 
Ser 

ACT 
Thr 
525 
CAC 
His 

CAG 
Gin 



255 
CGC CTC 
Arg Leu 
270 

GTC GAC 
Val Asp 

AGC CTA 
Ser Leu 

ACC TTC 
Thr Phe 



GGT 
Gly 

CAG 
Gin 
350 
TTC 
Phe 



TTG 
Leu 
335 
GCA 
Ala 

CCA 
Pro 



ACT 
Thr 

CAA 
Gin 

CCG 
Pro 

GCC 
Ala 
320 
CCA 
Pro 



TAT AAG GCG 
Tyr Lys Ala 
275 

GGC TTT TGG 
Gly Phe Trp 

290 
CAG TAC GAC 
Gin Tyr Asp 
305 

AGA CCA TCC 
Arg Pro Ser 

CAG GGT GGC 
Gin Gly Gly 



TCC TCG 
Ser Ser 

GCT GGC 
Ala Gly 



GAG CGC 
Glu Arg 

GGA GAC 
Gly Asp 



ACC 
Thr 

GTG 
Val 
430 
TTA 
Leu 



AAC 
Asn 
415 
CCC 
Pro 

GAG 
Glu 



GCT GAC 
Ala Asp 

CTT CTT 
Leu Leu 

AAC CCC 
Asn Pro 
495 
TTC AAC 
Phe Asn 
510 

GGC ATC 
Gly He 

TTC CGC 
Phe Arg 

GGT TGG 
Gly Trp 



TAC 
Tyr 

ACC 
Thr 
400 
AAC 
Asn 

CCT 
Pro 

ACT 
Thr 

TTT 
Phe 

AAA 
Lys 
480 
GTT 
Val 



GAC 
Asp 
385 
GTC 
Val 

CCC 
Pro 

TCA 
Ser 

CGA 
Arg 

GGA 
Gly 
465 
GAA 
Glu 



GGT GGC 
Gly Gly 
355 
TCC GTT 
Ser Val 
370 

CCG AAC 
Pro Asn 

ACT TTC 
Thr Phe 

GCC GGC 
Ala Gly 

AAC GCT 
Asn Ala 
435 
CCC TCC 
Pro Ser 
450 

CAG ACG 
Gin Thr 

ACA CTT 
Thr Leu 



TTC CAG CTC 
Phe Gin Leu 



AAT CCG GGT 
Asn Pro Gly 

CGT GAC TCA 
Arg Asp Ser 
530 

TCA CTC TCC 
Ser Leu Ser 
545 

GAA GGC GTC 
Glu Gly Val 



TAT 
Tyr 
515 
TTC 
Phe 

CAC 
His 

ACG 
Thr 



260 
ATA 
He 

ATC 
He 

GTC 
Val 

AGC 
Ser 

ACT 
Thr 
340 
TAC 
Tyr 

AGC 
Ser 

GAC 
Asp 

CGG 
Arg 

GGC 
Gly 
420 
TAC 
Tyr 

TCT 
Ser 

GTC 
Val 

GGC 
Gly 

ACG 
Thr 
500 
GAG 
Glu 

GAC 
Asp 

TCC 
Ser 

AAC 
Asn 



1110 



1158 



1206 



1254 



1302 



1350 



1398 



1446 



1494 



1542 



1590 



1638 



1686 



1734 



1782 



1830 



1878 



1926 



1974 
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550 555 560 

GTC AAC ACG CCT TTC GGC CAA TTC GCG CAC GCG GGC CTC CTC AAG AAT 2022 
Val Asn Thr Pro Phe Gly Gin Phe Ala His Ala Gly Leu Leu Lys Asn 
565 570 575 580 

GAG GAG ATC CTC TGC CTC GCC GAC GAC CTG GCC ACC CGT CTC ACA GGT 2070 
Glu Glu lie Leu Cys Leu Ala Asp Asp Leu Ala Thr Arg Leu Thr Gly 

585 590 595 

GTC TAC CCC GCC ACT GAC AAC TTC GCG GCC GCC GTT TCT GCC TTC GCC 2118 
Val Tyr Pro Ala Thr Asp Asn Phe Ala Ala Ala Val Ser Ala Phe Ala 

600 605 610 

GCG AAC ATG CTG TCC TCC GTG CTG AAG TCG GAG GCA ACG TCC TCC ATC 2166 
Ala Asn Met Leu Ser Ser Val Leu Lys Ser Glu Ala Thr Ser Ser lie 

615 620 625 

ATC AAG TCC GTT GGC GAG ACT GCC GTC GGC GCG GCT CAG TCC GGC CTC 2214 
He Lys Ser Val Gly Glu Thr Ala Val Gly Ala Ala Gin Ser Gly Leu 

630 635 640 

GCG AAG CTA CCC GGA CTG CTA ATG AGT GTA CCA GGG AAG ATT GCC GCG 2262 
Ala Lys Leu Pro Gly Leu Leu Met Ser Val Pro Gly Lys He Ala Ala 
645 650 655 660 

CGT GTC CGC GCG CGC CGA GCG CGC CGC CGC GCC GCT CGT GCC AAT 2307 
Arg Val Arg Ala Arg Arg Ala Arg Arg Arg Ala Ala Arg Ala Asn 
665 670 675 

TAGTTTGCTC GCTCCTGTTT CGCCGTTTCG TAAAACGGCG TGGTCCCGCA CAT TAC G CGT 2367 
ACCCTAAAGA CTCTGGTGAG TCCCCGTCGT T AC AC GAC GG GTCTGCCGCG GTTCGATTCC 2427 
ATTCCCAAGC GGCAAGAAGG ACGTAGTTAG CTCTGCGTCC CTCGGGATAC CA 24 7 9 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Ser Glu His Thr He Ala His Ser He Thr Leu Pro Pro Gly Tyr 

15 10 15 

Thr Leu Ala Leu He Pro Pro Glu Pro Glu Ala Gly Trp Glu Met Leu 

20 25 30 

Glu Trp Arg His Ser Asp Leu Thr Thr Val Ala Glu Pro Val Thr Phe 

35 40 45 

Gly Ser Ala Pro Thr Pro Ser Pro Ser Met Val Glu Glu Thr Asn Gly 

50 55 60 

Val Gly Pro Glu Gly Lys Phe Leu Pro Leu Thr He Ser Pro Leu Leu 
65 70 75 80 

His Lys Thr Ser Arg Lys Ala Leu Thr Pro Thr Pro Ser Leu Ser Pro 

85 90 95 

Ala Asn He Ser Ser Met Pro Glu Phe Arg Asn Trp Ala Lys Gly Lys 

100 105 110 

He Asp Leu Asp Ser Asp Ser He Gly Trp Tyr Phe Lys Tyr Leu Asp 

115 120 125 

Pro Ala Gly Ala Thr Glu Ser Ala Arg Ala Val Gly Glu Tyr Ser Lys 

130 135 140 

He Pro Asp Gly Leu Val Lys Phe Ser Val Asp Ala Glu lie Arg Glu 
145 150 155 160 

He Tyr Asn Glu Glu Cys Pro Val Val Thr Asp Val Ser Val Pro Leu 

165 170 175 

Asp Gly Arg Gin Trp Ser Leu Ser He Phe Ser Phe Pro Met Phe Arg 
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180 










185 










190 






Thr 


Ala 


Tyr 


Val 


Ala 


Val 


Ala 


Asn 


Val 


Glu 


Asn 


Lys 


Glu 


Met 


Ser 


Leu 






195 










200 










205 








Asp 


Val 


Val 


Asn 


Asp 


Leu 


He 


Glu 


Trp 


Leu 


Asn 


Asn 


Leu 


Ala 


Asp 


Trp 




210 










215 










220 










Arg 


Tyr 


Val 


Val 


Asp 


Ser 


Glu 


Gin 


Trp 


He 


Asn 


Phe 


Thr 


Asn 


Asp 


Thr 


225 










230 










235 










240 


Thr 


Tyr 


Tyr 


Val 


Arg 


He 


Arg 


Val 


Leu 


Arg 


Pro 


Thr 


Tyr 


Asp 


Val 


Pro 










245 










250 










255 




Asp 


Pro 


Thr 


Glu 


Gly 


Leu 


Val 


Arg 


Thr 


Val 


Ser 


Asp 


Tyr 


Arg 


Leu 


Thr 








260 










265 










270 






Tyr 


Lys 


Ala 


He 


Thr 


Cys 


Glu 


Ala 


Asn 


Met 


Pro 


Thr 


Leu 


Val 


Asp 


Gin 






275 










280 










285 








Gly 


Phe 


Trp 


He 


Gly 


Gly 


Gin 


Tyr 


Ala 


Leu 


Thr 


Pro 


Thr 


Ser 


Leu 


Pro 




290 










295 










300 










Gin 


Tyr 


Asp 


Val 


Ser 


Glu 


Ala 


Tyr 


Ala 


Leu 


His 


Thr 


Leu 


Thr 


Phe 


Ala 


305 










310 










315 










320 


Arg 


Pro 


Ser 


Ser 


Ala 


Ala 


Ala 


Leu 


Ala 


Phe 


Val 


Trp 


Ala 


Gly 


Leu 


Pro 










325 










330 










335 




Gin 


Gly 


Gly 


Thr 


Ala 


Pro 


Ala 


Gly 


Thr 


Pro 


Ala 


Trp 


Glu 


Gin 


Ala 


Ser 








340 










345 










350 






Ser 


Gly 


Gly 


Tyr 


Leu 


Thr 


Trp 


Arg 


His 


Asn 


Gly 


Thr 


Thr 


Phe 


Pro 


Ala 






355 










360 










365 








Gly 


Ser 


Val 


Ser 


Tyr 


Val 


Leu 


Pro 


Glu 


Gly 


Phe 


Ala 


Leu 


Glu 


Arg 


Tyr 




370 










375 










380 










Asp 


Pro 


Asn 


Asp 


Gly 


Ser 


Trp 


Thr 


Asp 


Phe 


Ala 


Ser 


Ala 


Gly 


Asp 


Thr 


385 










390 










395 










400 


Val 


Thr 


Phe 


Arg 


Gin 


Val 


Ala 


Val 


Asp 


Glu 


Val 


Val 


Val 


Thr 


Asn 


Asn 










405 










410 










415 




Pro 


Ala 


Gly 


Gly 


Gly 


Ser 


Ala 


Pro 


Thr 


Phe 


Thr 


Val 


Arg 


Val 


Pro 


Pro 








420 










425 










430 






Ser 


Asn 


Ala 


Tyr 


Thr 


Asn 


Thr 


Val 


Phe 


Arg 


Asn 


Thr 


Leu 


Leu 


Glu 


Thr 






435 










440 










445 








Arg 


Pro 


Ser 


Ser 


Arg 


Arg 


Leu 


Glu 


Leu 


Pro 


Met 


Pro 


Pro 


Ala 


Asp 


Phe 




450 










455 










460 










Gly 


Gin 


Thr 


Val 


Ala 


Asn 


Asn 


Pro 


Lys 


He 


Glu 


Gin 


Ser 


Leu 


Leu 


Lys 


465 










470 










475 










480 


Glu 


Thr 


Leu 


Gly 


Cys 


Tyr 


Leu 


Val 


His 


Ser 


Lys 


Met 


Arg 


Asn 


Pro 


Val 










485 










490 










495 




Phe 


Gin 


Leu 


Thr 


Pro 


Ala 


Ser 


Ser 


Phe 


Gly 


Ala 


Val 


Ser 


Phe 


Asn 


Asn 








500 










505 










510 






Pro 


Gly 


Tyr 


Glu 


Arg 


Thr 


Arg 


Asp 


Leu 


Pro 


Asp 


Tyr 


Thr 


Gly 


He 


Arg 






515 










520 










525 








Asp 


Ser 


Phe 


Asp 


Gin 


Asn 


Met 


Ser 


Thr 


Ala 


Val 


Ala 


His 


Phe 


Arg 


Ser 




530 


• 








535 










540 










Leu 


Ser 


His 


Ser 


Cys 


Ser 


He 


Val 


Thr 


Lys 


Thr 


Tyr 


Gin 


Gly 


Trp 


Glu 


545 










550 










555 










560 


Gly 


Val 


Thr 


Asn 


Val 


Asn 


Thr 


Pro 


Phe 


Gly 


Gin 


Phe 


Ala 


His 


Ala 


Gly 










565 










570 










575 




Leu 


Leu 


Lys 


Asn 


Glu 


Glu 


He 


Leu 


Cys 


Leu 


Ala 


Asp 


Asp 


Leu 


Ala 


Thr 








580 










585 










590 






Arg 


Leu 


Thr 


Gly 


Val 


Tyr 


Pro 


Ala 


Thr 


Asp 


Asn 


Phe 


Ala 


Ala 


Ala 


Val 






595 










600 










605 








Ser 


Ala 


Phe 


Ala 


Ala 


Asn 


Met 


Leu 


Ser 


Ser 


Val 


Leu 


Lys 


Ser 


Glu 


Ala 




610 










615 










620 










Thr 


Ser 


Ser 


He 


He 


Lys 


Ser 


Val 


Gly 


Glu 


Thr 


Ala 


Val 


Gly 


Ala 


Ala 


625 










630 










635 










640 
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Gin Ser Gly Leu Ala Lys Leu Pro Gly Leu Leu Met Ser Val Pro Gly 

645 650 655 

Lys lie Ala Ala Arg Val Arg Ala Arg Arg Ala Arg Arg Arg Ala Ala 
660 665 670 

Arg Ala Asn 
675 

INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 
STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGGGATCCAC AGTTCTGCCT CCCCCGGACG GTAAATATAG GGGAACCATG GTCTAGAGG 59 
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